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GENE EXPRESSION PROFILING OF EGFR POSITIVE CANCER 

Background of the Invention 

Field of the Invention 

The present invention concerns gene expression profiling of tissue samples obtained from 

5 EGFR-positive cancer. More specifically, the invention provides diagnostic, prognostic and 
predictive methods based on the molecular characterization of gene expression in paraffin- 
embedded, fixed tissue samples of EGFR-expressing cancer, which allow a physician to predict 
whether a patient is likely to respond well to treatment with an EGFR inhibitor. In addition, the 
present invention provides treatment methods based on such findings. 

10 Description of the Related Art 

Oncologists have a number of treatment options available to them, including different 
combinations of chemotherapeutic drugs that are characterized as "standard of care," and a 
number of drugs that do not carry a label claim for particular cancer, but for which there is 
evidence of efficacy in that cancer. Best likelihood of good treatment outcome requires that 

15 patients be assigned to optimal available cancer treatment, and that this assignment be made as 
quickly as possible following diagnosis. 

Currently, diagnostic tests used in clinical practice are single analyte, and therefore do 
not capture the potential value of knowing relationships between dozens of different markers. 
Moreover, diagnostic tests are frequently not quantitative, relying on immunohistochemistry. 

20 This method often yields different results in different laboratories, in part because the reagents 
are not standardized, and in part because the interpretations are subjective and cannot be easily 
quantified. RNA-based tests have not often been used because of the problem of RNA 
degradation over time and the fact that it is difficult to obtain fresh tissue samples from patients 
for analysis. Fixed paraffin-embedded tissue is more readily available and methods have been 

25 established to detect RNA in fixed tissue. However, these methods typically do not allow for the 
study of large numbers of genes (DNA or RNA) from small amounts of material. Thus, 
traditionally fixed tissue has been rarely used other than for immunohistochemistry detection of 
proteins. 

Recently, several groups have published studies concerning the classification of various 
30 cancer types by microarray gene expression analysis (see, e.g. Golub et ah, Science 286:531-537 
(1999); Bhattacharjae et al, Proc. Natl Acad Set USA 98:13790-13795 (2001); Chen-Hsiang et 
al, Bioinformatics 17 (Suppl. 1):S316-S322 (2001); Ramaswamy et al, Proa Natl Acad. Set 
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USA 98:15149-15154 (2001)). Certain classifications of human breast cancers based on gene 
expression patterns have also been reported (Martin et aL, Cancer Res. 60:2232-2238 (2000); 
West et aL, Proc. Natl Acad. Set USA 98:11462-11467 (2001); Sorlie et aL, Proc. Natl Acad 
Sci. USA 98:10869-10874 (2001); Yan et aL, Cancer Res. 61:8375-8380 (2001)). However, 
5 these studies mostly focus on improving and refining the already established classification of 
various types of cancer, including breast cancer, and generally do not link the findings to 
treatment strategies in order to improve the clinical outcome of cancer therapy. 

Although modern molecular biology and biochemistry have revealed more than 100 
genes whose activities influence the behavior of tumor cells, state of their differentiation, and 

10 their sensitivity or resistance to certain therapeutic drugs, with a few exceptions, the status of 
these genes has not been exploited for the purpose of routinely making clinical decisions about 
drug treatments. One notable exception is the use of estrogen receptor (ER) protein expression 
in breast carcinomas to select patients to treatment with anti-estrogen drugs, such as tamoxifen. 
Another exceptional example is the use of ErbB2 (Her2) protein expression in breast carcinomas 

15 to select patients with the Her2 antagonist drug Herceptin® (Genentech, Inc., South San 
Francisco, CA). 

Despite recent advances, the challenge of cancer treatment remains to target specific 
treatment regimens to pathogenically distinct tumor types, and ultimately personalize tumor 
treatment in order to optimize outcome. Hence, a need exists for tests that simultaneously 
20 provide predictive information about patient responses to the variety of treatment options.. 
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Summary of the Invention 
The present invention is based on findings of Phase II clinical studies of gene expression 
in tissue samples obtained from EGFR-expressing head and neck cancer or colon cancer of 
human patients who responded well or did not respond to (showed resistance to) treatment with 
5 EGFR inhibitors. 

Based upon such findings, in one aspect the present invention concerns a method for 

predicting the likelihood that a patient diagnosed with an EGFR-expressing cancer will respond 

to treatment with an EGFR inhibitor, comprising determining the expression level of one or more 

prognostic RNA transcripts or their products in a sample comprising EGFR-expressing cancer 
10 cells obtained from the patient, wherein the prognostic transcript is the transcript of one or more 

genes selected from the group consisting of: Bak; Bclx; BRAF; BRK; Cadl7; CCND3; CD105; 

CD44s; CD82; CD9; CGA;; CTSL; EGFRd27; ErbB3; EREG; GPC3; GUS; HGF; ID1; 

IGFBP3; ITGB3; ITGB3; p27; P53; PTPD1; RBI; RPLPO; STK15; SURV; TERC; TGFBR2; 

TIMP2; TITF1; XIAP; YB-1; A-Catenin; AKT1; AKT2; APC; Bax; B-Catenin; BTC; CA9; 
15 CCNA2; CCNE1; CCNE2; CD134; CD44E; CD44v3; CD44v6; CD68; CDC25B; CEACAM6; 

Chk2; cMet; COX2; cripto; DCR3; DIABLO; DPYD; DR5; EDN1 endothelin; EGFR; EIF4E; 

ERBB4; ERK1; fas; FRP1; GROl; HB-EGF; HER2; IGF1R; IRS1; ITGA3; KRT17; LAMC2; 

MTA1; NMYC; P14ARF; PAD; PDGFA; PDGFB; PGK1; PLAUR; PPARG; RANBP2; 

RASSF1; RIZ1; SPRY2; Src; TFRC; TP53BP1;UPA; and VEGFC, wherein (a) the patient is 
20 unlikely to benefit from treatment with an EGFR inhibitor if the normalized levels of any of the 

following genes A-Catenin; AKT1; AKT2; APC; Bax; B-Catenin; BTC; CA9; CCNA2; CCNE1; 

CCNE2; CD134; CD44E; CD44v3; CD44v6; CD68; CDC25B; CEACAM6; Chk2; cMet; 

COX2; cripto; DCR3; DIABLO; DPYD; DR5; EDN1 endothelin; EGFR; EIF4E; ERBB4; 

ERK1; fas; FRP1; GROl; HB-EGF; HER2; IGF1R; IRS1; ITGA3; KRT17; LAMC2; MTA1; 
25 NMYC; P14ARF; PAD; PDGFA; PDGFB; PGK1; PLAUR; PPARG; RANBP2; RASSF1; 

RIZ1; SPRY2; Src; TFRC; TP53BP1; upa; VEGFC, or their products are elevated above defined 

expression thresholds, .and (b) the patient is likely to benefit from treatment with an EGFR 

inhibitor if the normalized levels of any of the following genes Bak; Bclx; BRAF; BRK; Cadl7; 

CCND3; CD105; CD44s; CD82; CD9; CGA;; CTSL; EGFRd27; ErbB3; EREG; GPC3; GUS; 
30 HGF; ID1; IGFBP3; ITGB3; ITGB3; p27; P53; PTPD1; RBI; RPLPO; STK15; SURV; TERC; 

TGFBR2; TIMP2; TITF1; XIAP; and YB-1, or their products are elevated above defined 

expression thresholds. 

In another aspect, the present invention concerns a prognostic method comprising 
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(a) subjecting a sample comprising EGFR-expressing cancer cells obtained from a 
patient to quantitative analysis of the expression level of at least one gene selected from the 
group consisting of CD44v3; CD44v6; DR5; GROl; KRT17; and LAMC2 gene or their 
products, and 

(b) identifying the patient as likely to show resistance to treatment with an EGFR- 
inhibitor if the expression levels of such gene or genes, or their products, are elevated above a 
defined threshold. In a particular embodiment, the gene is LAMC2. 

In yet another aspect, the invention concerns a method for predicting the likelihood that a 
patient diagnosed with an EGFR-expressing head or neck cancer will respond to treatment with 
an EGFR inhibitor, comprising determining the expression level of one or more prognostic RNA 
transcripts or their products in a sample comprising EGFR-expressing cancer cells obtained from 
5 such patient, wherein the prognostic transcript is the transcript of one or more genes selected 
from the group consisting of: CD44s; CD82; CGA; CTSL; EGFRd27; IGFBP3; p27; P53; RBI; 
TIMP2; YB-1; A-Catenin; AKT1; AKT2; APC; Bax; B-Catenin; BTC; CCNA2; CCNE1; 
CCNE2; CD105; CD44v3; CD44v6; CD68; CEACAM6; Chk2; cMet; COX2; cripto; DCR3; 
DIABLO; DPYD; DR5; EDN1 endothelin; EGFR; EIF4E; ERBB4; ERK1; fas; FRP1; GROl; 

10 HB-EGF; HER2; IGF1R; IRS1; ITGA3; KRT17; LAMC2; MTA1; NMYC; PAI1; PDGFA; 
PGK1; PTPD1; RANBP2; SPRY2; TP53BP1; and VEGFC, wherein (a) normalized expression 
of one or more of A-Catenin; AKT1; AKT2; APC; Bax; B-Catenin; BTC; CCNA2; CCNE1; 
CCNE2; CD105; CD44v3; CD44v6; CD68; CEACAM6; Chk2; cMet; COX2; cripto; DCR3; 
DIABLO; DPYD; DR5; EDN1 endothelin; EGFR; EIF4E; ERBB4; ERK1; fas; FRP1; GROl; 

15 HB-EGF; HER2; IGF1R; IRS1; ITGA3; KRT17; LAMC2; MTA1; NMYC; PAI1; PDGFA; 
PGK1; PTPD1; RANBP2; SPRY2; TP53BP1; VEGFC, or the corresponding gene product, 
above determined expression thresholds indicates that the patient is likely to show resistance to 
treatment with an EGFR inhibitor, and (b) normalized expression of one or more ofCD44s; 
CD82; CGA; CTSL; EGFRd27; IGFBP3; p27; P53; RBI; TIMP2; YB-1, or the corresponding 

20 gene product, above defined expression thresholds indicates that the patient is likely to respond 
well to treatment with an EGFR inhibitor. 

In a further aspect, the invention concerns a method for predicting the likelihood that a 
patient diagnosed with an EGFR-expressing colon cancer will respond to treatment with an 
EGFR inhibitor, comprising determining the expression level of one or more prognostic RNA 

25 transcripts or their products in a sample comprising EGFR-expressing cancer cells obtained from 
the patient, wherein the prognostic transcript is the transcript of one or more genes selected from 
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the group consisting of Bak; Bclx; BRAF; BRK; Cadi 7; CCND3; CCNE1; CCNE2; CD105; 
CD9; COX2; DIABLO; ErbB3; EREG; FRP1; GPC3; GUS; HER2; HGF; DDI; ITGB3; PTPD1; 
RPLPO; STK15; SURV; TERC; TGFBR2; TITFl; XIAP; CA9; CD134; CD44E; CD44v3; 
CD44v6; CDC25B; CGA; DR5; GROl ; KRT17; LAMC2; P14ARF; PDGFB; PLAUR; PPARG; 
5 RASSF1; RIZ1; Src; TFRC; and UP A, wherein (a) elevated expression of one or more of CA9; 
CD134; CD44E; CD44v3; CD44v6; CDC25B; CGA; DR5; GROl; KRT17; LAMC2; P14ARF; 
PDGFB; PLAUR; PPARG; RASSF1; RIZ1; Src; TFRC; and UP A, or the corresponding gene 
product, above defined expression thresholds indicates that the patient is likely to show 
resistance to treatment with an EGFR inhibitor, and normalized expression of one or more of 
10 Bak; Bclx; BRAF; BRK; Cadl7; CCND3; CCNE1; CCNE2; CD105; CD9; COX2; DIABLO; 
ErbB3; EREG; FRP1; GPC3; GUS; HER2; HGF; ID1; ITGB3; PTPD1; RPLPO; STK15; 
SURV; TERC; TGFBR2; TITFl; XIAP, or the corresponding gene product, above certain 
expression thresholds indicates that the patient is likely to respond well to treatment with an 
EGFR inhibitor. 

15 In another aspect, the invention concerns a method comprising treating a patient 

diagnosed with an EGFR-expressing cancer and determined to have elevated normalized levels 

of one or more of the RNA transcripts of Bak; Bclx; BRAF; BRK; Cadl7; CCND3; CD105; 

CD44s; CD82; CD9; CGA;; CTSL; EGFRd27; ErbB3; EREG; GPC3; GUS; HGF; ID1; 

IGFBP3; ITGB3; ITGB3; p27; P53; PTPD1; RBI; RPLPO; STK15; SURV; TERC; TGFBR2; 
20 TIMP2; TITFl; XIAP; YB-1; A-Catenin; AKT1; AKT2; APC; Bax; B-Catenin; BTC; CA9; 

CCNA2; CCNE1; CCNE2; CD134; CD44E; CD44v3; CD44v6; CD68; CDC25B; CEACAM6; 

Chk2; cMet; COX2; cripto; DCR3; DIABLO; DP YD; DR5; EDN1 endothelin; EGFR; EIF4E; 

ERBB4; ERK1; fas; FRP1; GROl; HB-EGF; HER2; IGF1R; IRS1; ITGA3; KRT17; LAMC2; 

MTA1; NMYC; P14ARF; PAH; PDGFA; PDGFB; PGK1; PLAUR; PPARG; RANBP2; 
25 RASSF1; RIZ1; SPRY2; Src; TFRC; TP53BP1; UP A; and VEGFC genes, or the corresponding 

gene products in the cancer, with an effective amount of an EGFR-innibitor, wherein elevated 

RNA transcript level is defined by a defined expression threshold. 

In yet another aspect, the invention concerns a method comprising treating a patient 

diagnosed with an EGFR-expressing head or neck cancer and determined to have elevated 
30 normalized expression of one or more of the RNA transcripts of CD44s; CD 82; CGA; CTSL; 

EGFRd27; IGFBP3; p27; P53; RBI; TIMP2; YB-1; A-Catenin; AKT1; AKT2; APC; Bax; B- 

Catenin; BTC; CCNA2; CCNE1; CCNE2; CD105; CD44v3; CD44v6; CD68; CEACAM6; 

Chk2; cMet; COX2; cripto; DCR3; DIABLO; DPYD; DR5; EDN1 endothelin; EGFR; EIF4E; 
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ERBB4; ERK1; fas; FRP1; GROl; HB-EGF; HER2; IGF1R; IRS1; ITGA3; KRT17; LAMC2; 
MTA1; NMYC; PAI1; PDGFA; PGK1; PTPD1; RANBP2; SPRY2; TP53BP1; VEGFC genes, 
or the corresponding gene products in said cancer, with an effective amount of an EGFR- 
inhibitor, wherein elevated normalized RNA transcript level is defined by a defined expression 
5 threshold. 

In a further aspect, the invention concerns a method comprising treating a patient 
diagnosed with an EGFR-expressing colon cancer and detennined to have elevated normalized 
expression of one or more of the RNA transcripts of Bak; Bclx; BRAF; BRK; Cadi 7; CCND3; 
CCNE1; CCNE2; CD105; CD9; COX2; DIABLO; ErbB3; EREG; FRP1; GPC3; GUS; HER2; 
10 HGF; ID1; ITGB3; PTPD1; RPLPO; STK15; SURV; TERC; TGFBR2; TITF1; XIAP; CA9; 
CD134; CD44E; CD44v3; CD44v6; CDC25B; CGA; DR5; GROl; KRT17; LAMC2; P14ARF; 
PDGFB; PLAUR; PPARG; RASSF1; RIZ1; Src; TFRC; UPA genes, or the corresponding gene 
products in such cancer, with an effective amount of an EGFR-inhibitor, wherein elevated 
normalized RNA transcript level is defined by a defined expression threshold. 
15 The invention further concerns an array comprising (a) polynucleotides hybridizing to the 

following genes: Bak; Bclx; BRAF; BRK; Cadl7; CCND3; CD105; CD44s; CD82; CD9; CGA;; 
CTSL; EGFRd27; ErbB3; EREG; GPC3; GUS; HGF; ID1; IGFBP3; ITGB3; ITGB3; p27; P53; 
PTPD1; RBI; RPLPO; STK15; SURV; TERC; TGFBR2; TIMP2; TITF1; XIAP; YB-1; A- 
Catenin; AKT1; AKT2; APC; Bax; B-Catenin; BTC; CA9; CCNA2; CCNE1; CCNE2; CD134; 
20 CD44E; CD44v3; CD44v6; CD68; CDC25B; CEACAM6; Chk2; cMet; COX2; cripto; DCR3; 
DIABLO; DP YD; DR5; EDN1 endothelin; EGFR; EIF4E; ERBB4; ERK1; fas; FRP1; GROl; 
HB-EGF; HER2; IGF1R; IRS1; ITGA3; KRT17; LAMC2; MTA1; NMYC; P14ARF; PAH; 
PDGFA; PDGFB; PGK1; PLAUR; PPARG; RANBP2; RASSF1; RIZ1; SPRY2; Src; TFRC; 
TP53BP1;UPA; VEGFC; or (b) an array comprising polynucleotides hybridizing to the 
25 following genes: CD44v3; CD44v6; DR5; GROl; KRT17; and LAMC2, immobilized on a solid 
surface; or (c) an array comprising polynucleotides hybridizing to the following genes: CD44s; 
CD82; CGA; CTSL; EGFRd27; IGFBP3; p27; P53; RBI; TIMP2; YB-1; A-Catenin; AKT1; 
AKT2; APC; Bax; B-Catenin; BTC; CCNA2; CCNE1; CCNE2; CD 105; CD44v3; CD44v6; 
CD68; CEACAM6; Chk2; cMet; COX2; cripto; DCR3; DIABLO; DP YD; DR5; EDN1 
30 endothelin; EGFR; EIF4E; ERBB4; ERK1; fas; FRP1; GROl; HB-EGF; HER2; IGF1R; IRS1; 
ITGA3; KRT17; LAMC2; MTA1; NMYC; PAH; PDGFA; PGK1; PTPD1; RANBP2; SPRY2; 
TP53BP1; and VEGFC, immobilized on a solid surface, or (d) an array comprising 
polynucleotides hybridizing to the following genes: Bak; Bclx; BRAF; BRK; Cadl7; CCND3; 
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CCNE1; CCNE2; CD105; CD9; COX2; DIABLO; ErbB3; EREG; FRP1; GPC3; GUS; HER2; 
HGF; ID1; ITGB3; PTPD1; RPLPO; STK15; SURV; TERC; TGFBR2; TITF1; XIAP; CA9; 
CD134; CD44E; CD44v3; CD44v6; CDC25B; CGA; DR5; GROl; KRT17; LAMC2; P14ARF; 
PDGFB; PLAUR; PPARG; RASSF1; RIZ1; Src; TFRC; and UP A, immobilized on a solid 
5 surface. 

In a further aspect, the invention concerns a method in which RNA is isolated from a 
fixed, paraffin-embedded tissue specimen by a procedure comprising: 

(a) incubating a section of the fixed, paraffin-embedded tissue specimen at a 
temperature of about 56 °C to 70 °C in a lysis buffer, in the presence of a protease, without prior 
dewaxing, to form a lysis solution; 

(b) cooling the lysis solution to a temperature where the wax solidifies; and 

(c) isolating the nucleic acid from the lysis solution. 

In a different aspect, the invention concerns a kit comprising one or more of (1) 
10 extraction buffer/reagents and protocol; (2) reverse transcription buffer/reagents and protocol; 
and (3) qPCR buffer/reagents and protocol suitable for performing the gene expression analysis 
methods of the invention. 

In a further aspect, the invention concerns a method for measuring levels of mRNA 
products of genes listed in Tables 5 A and 5B by quantitative RT-PCR (qRT-PCR) reaction, by 
15 using an amplicon listed in Tables 5 A and 5B and a corresponding primer-probe set listed in 
• Tables 6A-6F. 

Brief Description of the Drawings 
Figure 1 is a chart illustrating the overall workflow of the process of the invention for 
measurement of gene expression. In the Figure, FPET stands for "fixed paraffin-embedded 
20 tissue," and "RT-PCR" stands for "reverse transcriptase PCR." RNA concentration is 
determined by using the commercial RiboGreen™ RNA Quantitation Reagent and Protocol. 

Figure 2 is a flow chart showing the steps of an RNA extraction method according to the 
invention alongside a flow chart of a representative commercial method. 

Detailed Description of the Preferred Embodiment 
25 A. Definitions 

Unless defined otherwise, technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Singleton et aL, Dictionary of Microbiology and Molecular Biology 2nd ed., J. Wiley 
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& Sons (New York, NY 1994), and March, Advanced Organic Chemistry Reactions, 
Mechanisms and Structure 4th ed. ? John Wiley & Sons (New York, NY 1992), provide one 
skilled in the art with a general guide to many of the terms used in the present application. 

One skilled in the art will recognize many methods and materials similar or equivalent to 
5 those described herein, which could be used in the practice of the present invention. Indeed, the 
present invention is in no way limited to the methods and materials described. For purposes of 
the present invention, the following terms are defined below. 

The term "microarray" refers to an ordered arrangement of hybridizable array elements, 
preferably polynucleotide probes, on a substrate. 

10 The term "polynucleotide," when used in singular or plural, generally refers to any 

polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA or 
modified RNA or DNA. Thus, for instance, polynucleotides as defined herein include, without 
limitation, single- and double-stranded DNA, DNA including single- and double-stranded 
regions, single- and double-stranded RNA, and RNA including single- and double-stranded 

15 regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more 
typically, double-stranded or include single- and double-stranded regions. In addition, the term 
"polynucleotide" as used herein refers to triple-stranded regions comprising RNA or DNA or 
both RNA and DNA. The strands in such regions may be from the same molecule or from 
different molecules. The regions may include all of one or more of the molecules, but more 

20 typically involve only a region of some of the molecules. One of the molecules of a triple-helical 
region often is an oligonucleotide. The term "polynucleotide" specifically includes cDNAs. The 
term includes DNAs (including cDNAs) and RNAs that contain one or more modified bases. 
Thus, DNAs or RNAs with backbones modified for stability • or- for other reasons are 
"polynucleotides" as that term is intended herein. Moreover, DNAs or RNAs comprising 

25 unusual bases, such as inosine, or modified bases, such as tritiated bases, are included within the 
term "polynucleotides" as defined herein. In general, the term "polynucleotide" embraces all 
chemically, enzymatically and/or metabolically modified forms of unmodified polynucleotides, 
as well as the chemical forms of DNA and RNA characteristic of viruses and cells, including 
simple and complex cells. 

30 The term "oligonucleotide" refers to a relatively short polynucleotide, including, without 

limitation, single-stranded deoxyribonucleotides, single- or double-stranded ribonucleotides, 
RNA:DNA hybrids and double-stranded DNAs. Oligonucleotides, such as single-stranded DNA 
probe oligonucleotides, are often synthesized by chemical methods, for example using automated 
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oligonucleotide synthesizers that are commercially available. However, oligonucleotides can be 
made by a variety of other methods, including in vitro recombinant DNA-mediated techniques 
and by expression of DNAs in cells and organisms. 

The terms "differentially expressed gene," "differential gene expression" and their 
5 synonyms, which are used interchangeably, refer to a gene whose expression is activated to a 
higher or lower level in a subject suffering from a disease, specifically cancer, such as breast 
cancer, relative to its expression in a normal or control subject. The terms also include genes 
whose expression is activated to a higher or lower level at different stages of the same disease. It 
is also understood that a differentially expressed gene may be either activated or inhibited at the 

10 nucleic acid level or protein level, or may be subject to alternative splicing to result in a different 
polypeptide product. Such differences may be evidenced by a change in mRNA levels, surface 
expression, secretion or other partitioning of a polypeptide, for example. Differential gene 
expression may include a comparison of expression between two or more genes or their gene 
products, or a comparison of the ratios of the expression between two or more genes or their 

15 gene products, or even a comparison of two differently processed products of the same gene, 
which differ between normal subjects and subjects suffering from a disease, specifically cancer, 
or between various stages of the same disease. Differential expression includes both 
quantitative, as well as qualitative, differences in the temporal or cellular expression pattern in a 
gene or its expression products among, for example, normal and diseased cells, or among cells 

20 which have undergone different disease events or disease stages. For the purpose of this 
invention, "differential gene expression" is considered to be present when there is at least an 
about two-fold, preferably at least about four-fold, more preferably at least about six-fold, most 
preferably at least about ten-fold difference between the expression of a given gene in normal 
and diseased subjects, or in various stages of disease development in a diseased subject. 

25 The term "normalized" with regard to a gene transcript or a gene expression product 

refers to the level of the transcript or gene expression product relative to the mean levels of 
transcripts/products of a set of reference genes, wherein the reference genes are either selected 
based on their minimal variation across, patients, tissues or treatments ("housekeeping genes"), 
or the reference genes are the totality of tested genes. In the latter case, which is commonly 

30 referred to as "global normalization", it is important that the total number of tested genes be 
relatively large, preferably greater than 50. Specifically, the term 'normalized' with respect to 
an RNA transcript refers to the transcript level relative to the mean of transcript levels of a set of 
reference genes. More specifically, the mean level of an RNA transcript as measured by 
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TaqMan® RT-PCR refers to the Ct value minus the mean Ct values of a set of reference gene 
transcripts. 

The terms "expression threshold," and "defined expression threshold" are used 
interchangeably and refer to the level of a gene or gene product in question above which the gene 
5 or gene product serves as a predictive marker for patient response or resistance to a drug, in the 
present case an EGFR inhibitor drug. The threshold is defined experimentally from clinical 
studies such as those described in examples 1 and 2, below. The expression threshold can be 
selected either for maximum sensitivity (for example, to detect all responders to a drug), or for 
maximum selectivity (for example to detect only responders to a drug), or for minimum error. 

10 The phrase "gene amplification" refers to a process by which multiple copies of a gene or 

gene fragment are formed in a particular cell or cell line. The duplicated region (a stretch of 
amplified DNA) is often referred to as "amplicon." Usually, the amount of the messenger RNA 
(mRNA) produced, i.e., the level of gene expression, also increases in the proportion of the 
number of copies made of the particular gene expressed. 

15 The term "diagnosis" is used herein to refer to the identification of a molecular or 

pathological state, disease or condition, such as the identification of a molecular subtype of head 
and neck cancer, colon cancer, or other type of cancer. The term "prognosis" is used herein to 
refer to the prediction of the likelihood of cancer-attributable death or progression, including 
recurrence, metastatic spread, and drug resistance, of a neoplastic disease, such as breast cancer, 

20 or head and neck cancer. The term "prediction" is used herein to refer to the likelihood that a 
patient will respond either favorably or unfavorably to a drug or set of drugs, and also the extent 
of those responses, or that a patient will survive, following surgical removal or the primary 
tumor and/or chemotherapy for a certain period of time without cancer recurrence. The 
predictive methods of the present invention can be used clinically to make treatment decisions by 

25 choosing the most appropriate treatment modalities for any particular patient. The predictive 
methods of the present invention are valuable tools in predicting if a patient is likely to respond 
favorably to a treatment regimen, such as surgical intervention, chemotherapy with a given drug 
or drug combination, and/or radiation therapy, or whether long-term survival of the patient, 
following surgery and/or termination of chemotherapy or other treatment modalities is likely. 

30 The term "long-term" survival is used herein to refer to survival for at least 5 years, more 

preferably for at least 8 years, most preferably for at least 10 years following surgery or other 
treatment. 
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The term "increased resistance" to a particular drug or treatment option, when used in 
accordance with the present invention, means decreased response to a standard dose of the drug 
or to a standard treatment protocol. 

The term "decreased sensitivity" to a particular drug or treatment option, when used in 
5 accordance with the present invention, means decreased response to a standard dose of the drug 
or to a standard treatment protocol, where decreased response can be compensated for (at least 
partially) by increasing the dose of drug, or the intensity of treatment/ 

"Patient response" can be assessed using any endpoint indicating a benefit to the patient, 
including, without limitation, (1) inhibition, to some extent, of tumor growth, including slowing 
10 down and complete growth arrest; (2) reduction in the number of tumor cells; (3) reduction in 
tumor size; (4) inhibition (i.e., reduction, slowing down or complete stopping) of tumor cell 
infiltration into adjacent peripheral organs and/or tissues; (5) inhibition (i.e. reduction, slowing 
down or complete stopping) of metastasis; (6) enhancement of anti-tumor immune response, 
which may, but does not have to, result in the regression or rejection of the tumor; (7) relief, to 
15 some extent, of one or more symptoms associated with the tumor; (8) increase in the length of 
survival following treatment; and/or (9) decreased mortality at a given point of time following 
treatment. 

The term "treatment" refers to both therapeutic treatment and prophylactic or 
preventative measures, wherein the object is to prevent or slow down (lessen) the targeted 

20 pathologic condition or disorder. Those in need of treatment include those already with the 
disorder as well as those prone to have the disorder or those in whom the disorder is to be 
prevented. In tumor (e.g., cancer) treatment, a therapeutic agent may directly decrease the 
pathology of tumor cells, or render the tumor cells more susceptible to treatment by other 
therapeutic agents, e.g., radiation and/or chemotherapy. 

25 The term "tumor," as used herein, refers to all neoplastic cell growth and proliferation, 

whether malignant or benign, and all pre-cancerous and cancerous cells and tissues. 

The terms "cancer" and "cancerous" refer to or describe the physiological condition in 
mammals that is typically characterized by unregulated cell growth. Examples of cancer include 
but are not limited to, breast cancer, colon cancer, lung cancer, prostate cancer, hepatocellular 

30 cancer, gastric cancer, pancreatic cancer, cervical cancer, ovarian cancer, liver cancer, bladder 
cancer, cancer of the urinary tract, thyroid cancer, renal cancer, carcinoma, melanoma, head and 
neck cancer, and brain cancer. 
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The "pathology" of cancer includes all phenomena that compromise the well-being of the 
patient. This includes, without limitation, abnormal or uncontrollable cell growth, metastasis, 
interference with the nonnal functioning of neighboring cells, release of cytokines or other 
secretory products at abnormal levels, suppression or aggravation of inflammatory or 
5 immunological response, neoplasia, premalignancy, malignancy, invasion of surrounding or 
distant tissues or organs, such as lymph nodes, etc. 

The term "EGFR inhibitor" as used herein refers to a molecule having the ability to 
inhibit a biological function of a native epidermal growth factor receptor (EGFR). Accordingly, 
the term "inhibitor" is defined in the context of the biological role of EGFR. While preferred 
10 inhibitors herein specifically interact with (e.g. bind to) an EGFR, molecules that inhibit an 
EGFR biological activity by interacting with other members of the EGFR signal transduction 
pathway are also specifically included within this definition. A preferred EGFR biological 
activity inhibited by an EGFR inhibitor is associated with the development, growth, or spread of 
a tumor. 

15 The term "housekeeping gene" refers to a group of genes that codes for proteins whose 

activities are essential for the maintenance of cell function. These genes are typically similarly 
expressed in all cell types. Housekeeping genes include, without limitation, glyceraldehyde-3- 
phosphate dehydrogenase (GAPDH), Cypl, albumin, actins, e.g. p-actin, tubulins, cyclophilin, 
hypoxantine phsophoribosyltransferase (HRPT), L32. 28S, and 18S. . 

20 B. Detailed Description 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of molecular biology (including recombinant techniques), microbiology, 
cell biology, and biochemistry, which are within the skill of the art. Such techniques are' 
explained fully in the literature, such as, "Molecular Cloning: A Laboratory Manual", 2 nd edition 

25 (Sambrook et al., 1989); "Oligonucleotide Synthesis" (MJ. Gait, ed., 1984); "Animal Cell 
Culture" (R.I. Freshney, ed., 1987); "Methods in Enzymology" (Academic Press, Inc.); 
"Handbook of Experimental Immunology", 4 th edition (D.M. Weir & C.C. Blackwell, eds., 
Blackwell Science Inc., 1987); "Gene Transfer Vectors for Mammalian Cells" (J.M. Miller & 
M.P. Calos, eds. 5 1987); "Current Protocols in Molecular Biology" (F.M. Ausubel et al., eds., 

30 1987); and "PGR: The Polymerase Chain Reaction", (Mullis et al., eds., 1994). 
1. Gene Expression Profiling 

In general, methods of gene expression profiling can be divided into two large groups: 
methods based on hybridization analysis of polynucleotides, and methods based on sequencing 
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of polynucleotides. The most commonly used methods known in the art for the quantification of 
mRNA expression in a sample include northern blotting and in situ hybridization (Parker & 
Barnes, Methods in Molecular Biology 106:247-283 (1999)); RNAse protection assays (Hod, 
Biotechniques 13:852-854 (1992)); and reverse transcription polymerase chain reaction (RT- 
PCR) (Weis et al, Trends in Genetics 8:263-264 (1992)). Alternatively, antibodies may be 
employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and 
DNA-RNA hybrid duplexes or DNA-protein duplexes. Representative methods for sequencing- 
based gene expression analysis include Serial Analysis of Gene Expression (SAGE), and gene 
expression analysis by massively parallel signature sequencing (MPSS). 

2. Reverse Transcriptase PCR (RT-PCR) 

Of the techniques listed above, the most sensitive and most flexible quantitative method 
is RT-PCR, which can be used to compare mRNA levels in different sample populations, in 
normal and tumor tissues, with or without drug treatment, to characterize patterns of gene 
expression, to discriminate between closely related mRNAs, and to analyze RNA structure. 

The first step is the isolation of mRNA from a target sample. The starting material is 
typically total RNA isolated from human tumors or tumor cell lines, and corresponding normal 
tissues or cell lines, respectively. Thus RNA can be isolated from a variety of primary tumors, 
including breast, lung, colon, prostate, brain, liver, kidney, pancreas, spleen, thymus, testis, 
ovary, uterus, head and neck, etc., tumor, or tumor cell lines, with pooled DNA from healthy 
donors. If the source of mRNA is a primary tumor, mRNA can be extracted, for example, from 
frozen or archived paraffin-embedded and fixed (e.g. formalin-fixed) tissue samples. 

General methods for mRNA extraction are well known in the art and are disclosed in 
standard textbooks of molecular biology, including Ausubel et al, Current Protocols of 
Molecular Biology, John Wiley and Sons (1997). Methods for RNA extraction from paraffin 
embedded tissues are disclosed, for example, in Rupp and Locker, Lab Invest. 56:A67 (1987), 
and De Andres et al, BioTechniques 18:42044 (1995). In particular, RNA isolation can be 
performed using purification kit, buffer set and protease from commercial manufacturers, such as 
Qiagen, according to the manufacturer's instructions. For example, total RNA from cells in 
culture can be isolated using Qiagen RNeasy mini-columns. Other commercially available RNA 
isolation kits include MasterPure™ Complete DNA and RNA Purification Kit (EPICENTRE®, 
Madison, WI), and Paraffin Block RNA Isolation Kit (Ambion, Inc.). Total RNA from tissue 
samples can be isolated using RNA Stat-60 (Tel-Test). RNA prepared from tumor can be 
isolated, for example, by cesium chloride density gradient centrifugation. 
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As RNA cannot serve as a template for PCR, the first step in gene expression profiling by 
RT-PCR is the reverse transcription of the RNA template into cDNA, followed by its 
exponential amplification in a PCR reaction. The two most commonly used reverse 
transcriptases are aviio myeloblastosis virus reverse transcriptase (AMV-RT) and Moloney 
5 murine leukemia virus reverse transcriptase (MMLV-RT). The reverse transcription step is 
typically primed using specific primers, random hexamers, or oligo-dT primers, depending on 
the circumstances and the goal of expression profiling. For example, extracted RNA can be 
reverse-transcribed using a GeneAmp RNA PCR kit (Perkin Elmer, CA, USA), following the 
manufacturer's instructions. The derived cDNA can then be used as a template in the subsequent 
10 PCR reaction. 

Although the PCR step can use a variety of thermostable DNA-dependent DNA 
polymerases, it typically employs the Taq DNA polymerase, which has a 5' -3' nuclease activity 
but lacks a 3' -5 5 proofreading endonuclease activity. Thus, TaqMan® PCR typically utilizes the 
5' -nuclease activity of Taq or Tth polymerase to hydrolyze a hybridization probe bound to its 

15 target amplicon, but any enzyme with equivalent 5' nuclease activity can be used. Two 
oligonucleotide primers are used to generate an amplicon typical of a PCR reaction. A third 
oligonucleotide, or probe, is designed to detect nucleotide sequence located between the two 
PCR primers. The probe is non-extendible by Taq DNA polymerase enzyme, and is labeled with 
a reporter fluorescent dye and a quencher fluorescent dye! Any laser-induced emission from the 

20 reporter dye is quenched by the quenching dye when the two dyes are located close together as 
they are on the probe. During the amplification reaction, the Taq DNA polymerase enzyme 
cleaves the probe in a template-dependent manner. The resultant probe fragments disassociate in 
solution, and signal from the released reporter dye is free from the quenching effect of the 
second fluorophore. One molecule of reporter dye is liberated for each new molecule 

25 synthesized, and detection of the unquenched reporter dye provides the basis for quantitative 
interpretation of the data. 

TaqMan® RT-PCR can be performed using commercially available equipment, such as, 
for example, ABI PRISM 7700™ Sequence Detection System™ (Perkin-Elmer-Applied 
Biosystems, Foster City, CA, USA), or Lightcycler (Roche Molecular Biochemicals, Mannheim, 

30 Germany). In a preferred embodiment, the 5 ! nuclease procedure is run on a real-time 
quantitative PCR device such as the ABI PRISM 7700™ Sequence Detection System™. The 
system consists of a thermocycler, laser, charge-coupled device (CCD), camera and computer. 
The system amplifies samples in a 96-well format on a thermocycler. During amplification, 
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laser-induced fluorescent signal is collected in real-time through fiber optics cables for all 96 
wells, and detected at the CCD. The system includes software for running the. instrument and for 
analyzing the data. 

5-Nuclease assay data are initially expressed as Ct, or the threshold cycle. As discussed 
5 above, fluorescence values are recorded during every cycle and represent the amount of product 
amplified to that point in the amplification reaction. The point when the fluorescent signal is 
first recorded as statistically significant is the threshold cycle (C t ). 

To minimize errors and the effect of sample-to-sample variation, RT-PCR is usually 
performed using an internal standard. The ideal internal standard is expressed at a constant level 

10 among different tissues, and is unaffected by the experimental treatment. RNAs most frequently 
used to normalize patterns of gene expression are mRNAs for the housekeeping genes 
glyceraldehyde-3-phosphate-dehydrogenase (GAPDH) and P-actin. 

A more recent variation of the RT-PCR technique is the real time quantitative PCR, 
which measures PCR product accumulation through a dual-labeled fluorigenic probe (i.e., 

15 TaqMan® probe). Real time PCR is compatible both with quantitative competitive PCR, where 
internal competitor for each target sequence is used for normalization, and with quantitative 
comparative PCR using a normalization gene contained within the sample, or a housekeeping 
gene for RT-PCR. For further details see, e.g. Held et al, Genome Research 6:986-994 (1996). 

According to one aspect of the present invention, PCR primers and probes are designed 

20 based upon intron sequences present in the gene to be amplified. In this embodiment, the first 
step in the primer/probe design is the delineation of intron sequences within the genes. This can 
be done by publicly available software, such as the DNA BLAT software developed by Kent, 
W.J., Genome Res. 12(4):656-64 (2002), or by the BLAST software including its variations. 
Subsequent steps follow well established methods of PCR primer and probe design. 

25 In order to avoid non-specific signals, it is important to mask repetitive sequences within 

the introns when designing the primers and probes. This can be easily accomplished by using 
the Repeat Masker program available on-line through the Baylor College of Medicine, which 
screens DNA sequences against a library of repetitive elements and returns a query sequence in 
which the repetitive elements are masked. The masked intron sequences can then be used to 

30 design primer and probe sequences using any commercially or otherwise publicly available 
primer/probe design packages, such as Primer Express (Applied Biosystems); MGB assay-by- 
design (Applied Biosystems); PrimerS (Steve Rozen and Helen J. Skaletsky (2000) Primer3 on 
the WWW for general users and for biologist programmers. In: Krawetz S, Misener S (eds) 
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Bioinformatics Methods and Protocols: Methods in Molecular Biology. Humana Press, Totowa, 
NJ,pp 365-386) 

The most important factors considered in PCR primer design include primer length, 
melting temperature (Tm), and G/C content, specificity, complementary primer sequences, and 
5 3'-end sequence. In general, optimal PCR primers are generally 17-30 bases in length, and 
contain about 20-80%,. such as, for example, about 50-60% G+C bases. Tm's between 50 and 80 
°C, e.g. about 50 to 70 °C are typically preferred. 

For further guidelines for PCR primer and probe design see, e.g. Dieffenbach, C.W. et 
al, "General Concepts for PCR Primer Design" in: PCR Primer, A Laboratoiy Manual, Cold 
10 Spring Harbor Laboratory Press, New York, 1995, pp. 133-155; Innis and Gelfand, 
"Optimization of PCRs" in: PCR Protocols, A Guide to Methods and Applications, CRC Press, 
London, 1994, pp. 5-11; and Plasterer, T.N. Primerselect: Primer and probe design. Methods 
Mol Biol 70:520-527 (1997), the entire disclosures of which are hereby expressly incorporated 
by reference. 
15 3. Microarravs 

Differential gene -expression can also be identified, or confirmed using the microarray 
technique. Thus, the expression profile of breast cancer-associated genes can be measured in 
either fresh or paraffin-embedded tumor tissue, using microarray technology. In this method, 
polynucleotide sequences of interest (including cDNAs and oligonucleotides) are plated, or 
20 arrayed, on a microchip substrate. The arrayed sequences are then hybridized with specific DNA 
probes from cells or tissues of interest. Just as in the RT-PCR method, the source of mRNA 
typically is total RNA isolated from human tumors or tumor cell lines, and corresponding normal 
tissues or cell lines. Thus RNA can be isolated from a variety of primary tumors or tumor cell 
lines. If the source of mRNA is a primary tumor, mRNA can be extracted, for example, from 
25 frozen or archived paraffin-embedded and fixed (e.g. formalin-fixed) tissue samples, which are 
routinely prepared and preserved in everyday clinical practice. 

In a specific embodiment of the microarray technique, PCR amplified inserts of cDNA 
clones are applied to a substrate in a dense array. Preferably at least 10,000 nucleotide sequences 
are applied to the substrate. The microarrayed genes, immobilized on the microchip at 10,000 
30 elements each, are suitable for hybridization under stringent conditions. Fluorescently labeled 
cDNA probes may be generated through incorporation of fluorescent nucleotides by reverse 
transcription of RNA extracted from tissues of interest. Labeled cDNA probes applied to the 
chip hybridize with specificity to each spot of DNA on the array. After stringent washing to 
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remove non-specifically bound probes, the chip is scanned by confocal laser microscopy or by 
another detection method, such as a CCD camera. Quantitation of hybridization of each arrayed 
element allows for assessment of corresponding mRNA abundance. With dual color 
fluorescence, separately labeled cDNA probes generated from two sources of RNA are 
5 hybridized pairwise to the array. The relative abundance of the transcripts from the two sources 
corresponding to each specified gene is thus determined simultaneously. The miniaturized scale 
of the hybridization affords a convenient and rapid evaluation of the expression pattern for large 
numbers of genes. Such methods have been shown to have the sensitivity required to detect rare 
transcripts, which are expressed at a few copies per cell, and to reproducibly detect at least 

10 approximately two-fold differences in the expression levels (Schena et at, Proc. Natl Acad. Scl 
USA 93(2):106-149 (1996)). Microarray analysis can be performed by commercially available 
equipment, following manufacturer's protocols, such as by using the Affymetrix GenChip 
technology, or Incyte's microarray technology. 

The development of microarray methods for large-scale analysis of gene expression 

15 makes it possible to search systematically for molecular markers of cancer classification and 
outcome prediction in a variety of tumor types. 

4. Serial Analysis of Gene Expression (SAGE) 

Serial analysis of gene expression (SAGE) is a method that allows the simultaneous and 
quantitative analysis of a large number of gene transcripts, without the need of providing an 

20 individual hybridization probe for each transcript. First, a short sequence tag (about 10-14 bp) is 
generated that contains sufficient information to uniquely identify a transcript, provided that the 
tag is obtained from a unique position within each transcript. Then, many transcripts, are linked 
together to form long serial molecules, that can be sequenced, revealing the identity of the 
multiple tags simultaneously. The expression pattern of any population of transcripts can be 

25 quantitatively evaluated by determining the abundance of individual tags, and identifying the 
gene corresponding to each tag. For more details see, e.g. Velculescu et al, Science 270:484- 
487 (1995); and Velculescu et al, Cell 88:243-51 (1997). 

5. MassARRA Y Technology 

The MassARRA Y (Sequenom, San Diego, California) technology is an automated, high- 
30 throughput method of gene expression analysis using mass spectrometry (MS) for detection. 
According to this method, following the isolation of RNA, reverse transcription and PCR 
amplification, the cDNAs are subjected to primer extension. The cDNA-derived primer 
extension products are purified, and dipensed on a chip array that is pre-loaded with the 
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components needed for MALTI-TOF MS sample preparation. The various cDNAs present in the 
reaction are quantitated by analyzing" the peak areas in the mass spectrum obtained. 

6. Gene Expression Analysis by Massively Parallel Signature Sequencing (MPSS 
This method, described by Brenner et al, Nature Biotechnology 18:630-634 (2000), is a 

5 sequencing approach that combines non-gel-based signature sequencing with in vitro cloning of 
millions of templates on separate 5 jixm diameter microbeads. First, a microbead library of DNA 
templates is constructed by in vitro cloning. This is followed by the assembly of a planar array 
of the template-containing microbeads in a flow cell at a high density (typically greater than 3 x 
10 6 microbeads/cm 2 ). The free, ends of the cloned templates on each microbead are analyzed 
10 simultaneously, using a fluorescence-based signature sequencing method that does not require 
DNA fragment separation. This method has been shown to simultaneously and accurately 
provide, in a single operation, hundreds of thousands of gene signature sequences from a yeast 
cDNA library. 

7. Immunohistochemistry 

15 Immxuiohistochemistry methods are also suitable for detecting the expression levels of 

the prognostic markers of the present invention. Thus, antibodies or antisera, preferably 
polyclonal antisera, and most preferably monoclonal antibodies specific for each marker are used 
to detect expression. The antibodies can be. detected by direct labeling of the antibodies 
themselves, for example, with radioactive labels, fluorescent labels, hapten labels such as, biotin, 

20 or an enzyme such as horse radish peroxidase or alkaline phosphatase. Alternatively, unlabeled 
primary antibody is used in conjunction with a labeled secondary antibody, comprising antisera, 
polyclonal antisera or a monoclonal antibody specific for the primary antibody. 
Immunohistochemistry protocols and kits are well known in the art and are commercially 
available. 

25 8. Proteomics 

The term "proteome" is defined as the totality of the proteins present in a sample (e.g. 
tissue, organism, or cell culture) at a certain point of time. Proteomics includes, among other 
things, study of the global changes of protein expression in a sample (also referred to as 
"expression proteomics"). Proteomics typically includes the following steps: (1) separation of 

30 individual proteins in a sample by 2-D gel electrophoresis (2-D PAGE); (2) identification of tire 
individual proteins recovered from the gel, e.g. my mass spectrometry or N-terminal sequencing, 
and (3) analysis of the data using bioinformatics. Proteomics methods are valuable supplements 
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to other methods of gene expression profiling, and can be used, alone or in combination with 
other methods, to detect the products of the prognostic markers of the present invention. 

9. Improved Method for Isolation of Nucleic Acid front Archived Tissue 
5 Specimens 

In the first step of the method of the invention, total RNA is extracted from the source 
material of interest, including fixed, paraffin-embedded tissue specimens, and purified 
sufficiently to act as a substrate in an enzyme assay. While extration of total RNA can be 
performed by any method known in the art, in a particular embodiment, the invention relies on 

10 an improved method for the isolation of nucleic acid from archived, e.g. fixed, paraffin- 
embedded tissue specimens (FPET). 

Measured levels of mRNA species are useful for defining the physiological or 
pathological status of cells and tissues. RT-PCR (which is discussed above) is one of the most 
sensitive, reproducible and quantitative methods for this "gene expression profiling". Paraffm- 

15 embedded, formalin-fixed tissue is the most widely available material for such studies. Several 
laboratories have demonstrated that it is possible to successfully use fixed-paraffin-embedded 
tissue (FPET) as a source of RNA for RT-PCR (Stanta et al, Biotechniques 11:304-308 (1991); 
Stanta et al, Methods Mol Biol 86:23-26 (1998); Jackson et al, Lancet 1:1391 (1989); Jackson 
et al, J. Clin. Pathol 43:499-504 (1999); Finke et al, Biotechniques 14:448-453 (1993); 

20 Goldsworthy et al, Mol Carcinog. 25:86-91 (1999); Stanta and Bonin, Biotechniques 24:271- 
276 (1998); Godfrey et al, J. Mol Diagnostics 2:84 (2000); Specht et al, J. Mol Med. 78:B27 
(2000); Specht et al, Am. J. Pathol. 158:419-429 (2001)). This allows gene expression profiling 
to be carried out on the most commonly available source of human biopsy specimens, and 
therefore potentially to create new valuable diagnostic and therapeutic information. 

25 The most widely used protocols utilize hazardous organic solvents, such as xylene, or 

octane (Finke et al, supra) to dewax the tissue in the paraffin blocks before nucleic acid (RNA 
and/or DNA) extraction. Obligatory organic solvent removal (e.g. with ethanol) and rehydration 
steps follow, which necessitate multiple manipulations, and addition of substantial total time to 
the protocol, which can take up to several days. Commercial kits and protocols for RNA 

30 extraction from FPET [MasterPure™ Complete DNA and RNA Purification Kit (EPICENTRE®, 
Madison, WI); Paraffin Block RNA Isolation Kit (Ambion, Inc.) and RNeasy™ Mini kit 
(Qiagen, Chatsworth, CA)] use xylene for deparaffinization, in procedures which typically 
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require multiple centrifugations and ethanol buffer changes, and incubations following 
incubation with xylene. 

The method that can be used in the present invention provides an improved nucleic acid 
extraction protocol that produces nucleic acid, in particular RNA, sufficiently intact for gene 
5 expression measurements. The key step in this improved nucleic acid extraction protocol is the 
performance of dewaxing without the use of any organic solvent, thereby eliminating the need 
for multiple manipulations associated with the removal of the organic solvent, and substantially 
reducing the total time to the protocol. According to the improved method, wax, e.g. paraffin is 
removed from wax-embedded tissue samples by incubation at 65-75 °C in a lysis buffer that 

10 solubilizes the tissue and hydrolyzes the protein, following by cooling to solidify the wax. 

Figure 2 shows a flow chart of the improved RNA extraction protocol used herein in 
comparison with a representative commercial method, using xylene to remove wax. The times 
required for individual steps in the processes and for the overall processes are shown in the chart. 
As shown, the commercial process requires approximately 50% more time than the improved 

15 process used in performing the methods of the invention. 

The lysis buffer can be any buffer known for cell lysis. It is, however, preferred that 
oligo-dT-based methods of selectively purifying polyadenylated mRNA not be used to isolate 
RNA for the present invention, since the bulk of the mRNA molecules axe expected to be 
fragmented and therefore will not have an intact polyadenylated tail, and will not be recovered or 

20 available for subsequent analytical assays.. Otherwise, any number of standard nucleic acid 
purification schemes can be used. These include chaotrope and organic solvent extractions, 
extraction using glass beads or filters, salting out and precipitation based methods, or any of the 
purification methods known in the art to recover total RNA or total nucleic acids from a 
biological source. 

25 Lysis buffers are commercially available, such as, for example, from Qiagen, Epicentre, 

or Ambion. A preferred group of lysis buffers typically contains urea, and Proteinase K or other 
protease. Proteinase K is very useful in the isolation of high quality, undamaged DNA or RNA, 
since most mammalian DNases and RNases are rapidly inactivated by this enzyme, especially in 
the presence of 0.5 - 1% sodium dodecyl sulfate (SDS). This is particularly important in the 

30 case of RNA, which is more susceptible to degradation than DNA. While DNases require metal 
ions for activity, and can therefore be easily inactivated by chelating agents, such as EDTA, there 
is no similar co-factor requirement for RNases. 
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Cooling and resultant solidification of the wax permits easy separation of the wax from - 
the total nucleic acid, which can be conveniently precipitated, e.g. by isopropanol. Further 
processing depends on the intended purpose. If the proposed method of RNA analysis is subject 
to bias by contaminating DNA in an extract, the RNA extract can be further treated, e.g. by 
DNase, post purification to specifically remove DNA while preserving RNA. For example, if 
the goal is to isolate high quality RNA for subsequent RT-PCR amplification, nucleic acid 
precipitation is followed by the removal of DNA, usually by DNase treatment. However, DNA 
can be removed at various stages of nucleic acid isolation, by DNase or other techniques well 
known in the art. 

While the advantages of the improved nucleic acid extraction discussed above are most 
apparent for the isolation of RNA from archived, paraffin embedded tissue samples, the wax 
removal step of the present invention, which does not involve the use of an organic solvent, can 
also be included in any conventional protocol for the extraction of total nucleic acid (RNA and 
DNA) or DNA only. 

By using heat followed by cooling to remove paraffin, the improved process saves 
valuable processing time, and eliminates a series of manipulations, thereby potentially increasing 
the yield of nucleic acid. 

10. S'-multivlexed Gene Specific Primimz of Reverse Transcrivtion 

RT-PCR requires reverse transcription of the test RNA population as a first step. The 
most commonly used primer for reverse transcription is oligo-dT, which works well when RNA 
is intact. However, this primer will not be effective when RNA is highly fragmented as is the 
case in FPE tissues. 

The present invention includes the use of gene specific primers, which are roughly 20 
bases in length with a Tm optimum between about 58 °C and 60 °C. These primers will also 
serve as the reverse primers that drive PGR DNA amplification. 

An alternative approach is based on the use of random hexamers as primers for cDNA 
synthesis. However, we have experimentally demonstrated that the method of using a 
multiplicity of gene-specific primers is superior over the known approach using random 
hexamers. 

11. Normalization Sfratezv 

An important aspect of the present invention is to use the measured expression of certain 
genes by EGFR-expressing cancer tissue to provide information about the patient's likely 
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response to treatment with an EGFR-inhibitor. For this purpose it is necessary to correct for 
(normalize away) both differences in the amount Of RNA assayed and variability in the quality of 
the RNA used. Therefore, the assay typically measures and incorporates the expression of 
certain normalizing genes, including well known housekeeping genes, such as GAPDH and 
5 Cypl. Alternatively or in adddition, normalization can be based on the mean or median signal 
(Ct in the case of RT-PCR) of all of the assayed genes or a large subset thereof (global 
normalization approach). On a gene-by-gene basis, measured normalized amount of a patient 
tumor mRNA is compared to the amount found in a reference set of cancer tissue of the same 
type (e.g. head and neck cancer, colon cancer, etc.). The number (N) of cancer tissues in this 

10 reference set should be sufficiently high to ensure that different reference sets (as a whole) 
behave essentially the same way. If this condition is met, the identity of the individual cancer 
tissues present in a particular set will have no significant impact on the relative amounts of the 
genes assayed. Usually, the cancer tissue reference set consists of at least about 30, preferably at 
least about 40 different FPE cancer tissue specimens. Unless noted otherwise, normalized 

15 expression levels for each mRNA/tested tumor/patient will be expressed as a percentage of the 
expression level measured in the reference set. More specifically, the reference set of a 
sufficiently high number (e.g. 40) of tumors yields a distribution of normalized levels of each 
mRNA species. The level measured in a particular tumor sample to be analyzed falls at some 
percentile within this range, which can be determined by methods well known in the art. Below, 

20 unless noted otherwise, reference to expression levels of a gene assume normalized expression 
relative to the reference set although this is not always explicitly stated. 

12. EGFR Inhibitors 

The epidermal growth factor receptor (EGFR) family (which includes EGFR, erb-B2, 
erb-B3, and erb-B4) is a family of growth factor receptors that are frequently activated in 

25 epithelial malignancies. Thus, the epidermal growth factor receptor (EGFR) is known to be 
active in several tumor types, including, for example, ovarian cancer, pancreatic cancer, non- 
small cell lung cancer, breast cancer, colon cancer and head and neck cancer. Several EGFR 
inhibitors, such as ZD1839 (also known as gefitinib or Iressa); and OSI774 (Erlotinib, 
Tarceva™), are promising drug candidates for the treatment of EGFR-expressing cancer. 

30 Iressa, a small synthetic quinazoline, competitively inhibits the ATP binding site of 

EGFR, a growth-promoting receptor tyrosine kinase, and has been in Phase III clinical trials for 
the treatment of non-small-cell lung carcinoma. Another EGFR inhibitor, [agr]cyano- 
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[bgr]methyl-iy"-[(trifluoromethoxy)phenyl]-propenam (LFM-A12), has been shown to inhibit 
the proliferation and invasiveness of EGFR positive human breast cancer cells. 

Cetuximab is a monoclonal antibody that blocks the EGFR and EGFR-dependent cell 
growth. It is currently being tested in phase III clinical trials. 
5 Tarceva™ has shown promising indications of anti-cancer activity in patients with 

advanced ovarian cancer, and non-small cell lung and head and neck carcinomas. 

The present invention provides valuable tools to predict whether an EGFR-positive tumor 
is likely to respond to treatment with an EGFR-inhibitor. 

Recent publications further confirm the involvement of EGFR in gastrointestinal (e.g. 
10 colon) cancer, and associate its expression with poor survival. See, e.g. Khorana et aL, Proc. 
Am. Soc. Clin, Oncol 22:317 (2003). 

While the listed examples of EGFR inhibitors a small organic molecules, the findings of 
the present invention are equally applicable to other EGFR inhibitors, including, without 
limitation, anti-EGFR antibodies, antisense molecules, small peptides, etc. 
15 Further details of the invention will be apparent from the following non-limiting 

Examples. 

Example 1 

A Phase II Study of Gene Expression in Head and Neck Tumors 

A gene expression study was designed and conducted with the primary goal to 
20 molecularly characterize gene expression in paraffin-embedded, fixed tissue samples of head and 
neck cancer patients who responded or did not respond to treatment with an EGFR inhibitor. 
The results are based on the use of five different EGFR inhibitor drugs. 

Study design 

Molecular assays were performed on paraffin-embedded, formalin-fixed head and neck 
25 tumor tissues obtained from 14 individual patients diagnosed with head and neck cancer. 
Patients were included in the study only if histopathologic assessment, performed as described in 
the Materials and Methods section, indicated adequate amounts of tumor tissue. 
Materials and Methods 

Each representative tumor block was characterized by standard histopathology for 
30 diagnosis, semi-quantitative assessment of amount of tumor, and tumor grade. A total of 6 
sections (10 microns in thickness each) were prepared and placed in two Costar Brand 
Microcentrifuge Tubes (Polypropylene, 1.7 mL tubes, clear; 3 sections in each tube). If the 
tumor constituted less than 30% of the total specimen area, the sample may have been crudely 
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dissected by the pathologist, using gross microdissection, putting the tumor tissue directly into 
the Costar tube. 

If more than one tumor block was obtained as part of the surgical procedure, all tumor 
blocks were subjected to the same characterization, as described above, and the block most 
representative of the pathology was used for analysis. 
Gene Expression Analysis 

mRNA was extracted and purified from fixed, paraffin-embedded tissue samples, and 
prepared for gene expression analysis as described above. 

Molecular assays of quantitative gene expression were performed by RT-PCR, using the 
ABI PRISM 7900™ Sequence Detection System™ (Perkin-Elmer-Applied Biosystems, Foster 
City, CA, USA). ABI PRISM 7900™ consists of a thermocycler, laser, charge-coupled device 
(CCD), camera and computer. The system amplifies samples in a 384-well format on a 
thermocycler. During amplification, laser-induced fluorescent signal is collected hi real-time 
through fiber optics cables for all 384 wells, and detected at the CCD. The system includes 
software for running the instrument and for analyzing the data. 
Analysis and Results 

Tumor tissue was analyzed for 185 cancer-related genes and 7 reference genes. The 
threshold cycle (CT) values for each patient were normalized based on the mean of all genes for 
that particular patient. Clinical outcome data were available for all patients. 

Outcomes were classified as either response or no response. The results were analyzed in 
two different ways using two different criteria for response: partial response, or clinical benefit. 
The latter criterion combines partial or complete response with stable disease (minimum 3 
months). In this study, there were no complete responses, four cases of partial response and two 
cases of disease stabilization. 

We evaluated the relationship between gene expression and partial response by logistic 
regression and have identified the following genes as significant (p<0. 15), as indicated in the 
attached Table 1. The logistic model provides a means of predicting the probability (Pr) of a 
subject as being either a partial responder or not. The following equation defined the expression 
threshold for response. 

Pr (Response) = ^^^J^^aci and Pr ( No Response) = 1 - Pr (Response) 

In Table 1, the term "negative" indicates that greater expression of the gene decreased 
likelihood of response to treatment with EGFR inhibitor, and "positive" indicates that increased 
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expression of the gene increased likelihood of response to EGFR inhibitor. Results from 
analysis of head and neck cancer patient data using clinical benefit criteria are shown in Table 2. 

Overall increased expression of the following genes correlated with resistance of head 
and neck cancer to EGFR inhibitor treatment: A-Catenin; AKT1; AKT2; APC; Bax; B-Catenin; 
5 BTC; CCNA2; CCNE1; CCNE2; CD105; CD44v3; CD44v6; CD68; CEACAM6; Chk2; cMet; 
COX2; cripto; DCR3; DIABLO; DP YD; DR5; EDN1 endothelin; EGFR; EIF4E; ERBB4; 
ERK1; fas; FRP1; GROl; HB-EGF; HER2;TGF1R; IRS1; ITGA3; KRT17; LAMC2; MTA1; 
NMYC; PAI1; PDGFA; PGK1; PTPD1; RANBP2; SPRY2; TP53BP1; and VEGFC; and 
increased expression of the following genes correlated with response of head and neck cancer to 
10 EGFR inhibitor treatment: CD44s; CD82; CGA; CTSL; EGFRd27; IGFBP3; p27; P53; RBI; 
TIMP2; and YB-1 . 

Example 2 

A Phase II Study of Gene Expression in Colon Cancer 
15 In a study analogous to the study of head and neck cancer patients described in Example 

1, gene expression markers were sought that correlate with increased or decreased likelihood of 
colon cancer response to EGFR inhibitors. Sample preparation and handling and gene 
expression and data analysis were performed as in Example 1. 

Twenty-three colon adenocarcinoma patients in all were studied, using a 192 gene assay. 
20 188 of the. 192 genes were expressed above the limit of detection. Both pathological and clinical 
responses were evaluated. Following treatment with EGFR inhibitor, three patients were 
determined to have had a partial response, five to have stable disease and fifteen to have 
progressive disease. 

Table 3 shows the results obtained using the partial response criterion. 
25 Results from analysis of colon cancer patient data using clinical benefit criteria are shown 

in Table 4. 

Overall, increased expression of the following genes correlated with resistance of colon 
cancer to EGFR inhibitor treatment: CA9; CD134; CD44E; CD44v3; CD44v6; CDC25B; CGA; 
DR5; GROl; KRT17; LAMC2; P14ARF; PDGFB; PLAUR; PPARG; RASSF1; RIZ1; Src; 
30 TFRC; and UP A, and increased expression of the following genes correlated with sensitivity of 
colon cancer to EGFR inhibitor treatment: CD44s; CD82; CGA; CTSL; EGFRd27; IGFBP3; 
p27; P53; RBI; TIMP2; and YB-1. 
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Finally, it is noteworthy that increased expression of the following genes correlated with 
resistance to EGFR inhibitor treatment in both head and neck and colon cancer: CD44v3; 
CD44v6; DR5; GROl; KRT17; LAMC2. 

In similar experiments, the elevated expression of LAMC2, B-Catenin, Bax, GROl, Fas, 
5 or ITGA3 in EGFR-positive head and neck cancer was determined to be an indication that the 
patient is not likely to respond well to treatment with an EGFR inhibitor. On the other hand, 
elevated expression of YB-1, PTEN, CTSL, P53, STAT3, ITGB3, IGFBP3, RPLPO or p27 in 
EGFR-positive head and neck cancer was found to be an indication that the patient is likely to 
respond to EGFR inhibitor treatment. 
10 In another set of similar experiments, elevated expression of the following genes in 

EGFR-expressing colon cancer correlated with positive response to treatment: BAK; BCL2; 
BRAF; BRK; CCND3; CD9; ER2; ERBB4; EREG; ERK1; FRP1. Elevated expression of the 
following genes in EGFR-expressing colon cancer correlated with resistance to treatment: APN; 
CA9; CGND1; CDC25B; CD134; LAMC2; PDGFB; CD44v6; CYP1; DR5; GAPDH; IGFBP2; 
15 PLAUR; RASSF1 ; UPA. 

All references cited throughout the specification are hereby expressly incorporated by 
reference. 

Although the present invention is illustrated with reference to certain embodiments, it is 
not so limited. Modifications and variations are possible without diverting from the spirit of the 
20 invention. All such modifications and variations, which will be apparent to those skilled in the 
art, are specifically within the scope of the present invention. While the specific examples 
disclosed herein concern head and neck cancer and colon cancer, the methods of the present 
invention are generally applicable and can be extended to all EGFR-expressing cancers, and such 
general methods are specifically intended to be within the scope herein. 

25 
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Table 1 : Partial Response Genes for Head and Neck Study 



Gene Name 


Response 


Logistic Discriminat Function 


R2 


Likelihood 
Ratio Test 


Intercept 


Slope 


P Value 


cMet 


Negative 


26.5168713 


4.57143179 


0.6662 


0.0011 


LAMC2 


Negative 


5.29706425 


a no a OTinc 

1.28137295 


0.6155 


0.0017 


ITGA3 


Negative 


22.6008544 


3.17707499 


0.5063 


0.0044 


CD44v6 


Negative 


6.92255059 


4.3069909 


0.492 


0.005 


B-Catenin 


Negative 


7.85913706 


2.52965454 


0.4805 


0.0055 


PDGFA 


Negative 


o.OOIbooo 


1 .103oo4oo 


0.4318 


0.0085 


GRG1 


Negative 


8.37646635 


1.74815793 


0.4146 


0.0099 


ERK1 


Negative 


6.14712633 


1.64819007 


0.4024 


0.0111 


CD44v3 


Negative 


5.95094528 


0.000944/0 


0.3451 


0.0186 


Bax 


Negative 


5.34006632 


1 .19383253 


0.3361 


0.0202 


CGA 


Positive 


~70 <fOH 4 /ID 

-78.121 148 


-lU.OOoYo/ 


0.3266 


0:0221 


fas 


Negative 


7 07>l CM fH K 

( .2.1 4y 1 Ulo 


1 .004o40o0 


0.3251 


0.0224 


IGFBP3 


Positive 


-2.1529531 


-2.7937517 


0.3097 


0.0258 


MTA1 


Negative 


d C\~7 A C7077 

b.O/ 1b/Z/ / 


1.zoYoboY4 


0.3072 


0.0264 I 


YB-1 


Positive 


1 .73598983 


-4. 08591 /4 


0.2814 


0.0336 


DR5 


Negative 


y,uoouo4y 


i .^fooH-yy^-M- 


0.2703 


0.0373 


APC 


Negative 


5.775003 


1 .oooz42o9 


0.2512 


0.0447 


ERBB4 


Negative 


1 1 .9400200 


i .Ooouooy / 


0.2357 


0.0518 


CD68 


Negative 


0.0U0U04O/ 


1 .U04000 I 


0.2319 


0.0537 


cripto 


Negative 


a q cnn/i 070 
iy.0UU4o/ o 


z.o4yuyooo 


0.2251 


0.0574 


P53 


Positive 


-4. iy /O I Do 


- 1 .004 i i oy 


0.2208 


0.0598 


VEGFC 


Negative 


b.oobo44oy 


u.yuoio4/o 


. 0.2208 


0.0598 


A-Catenin 


Negative 


4.4 1 Z 1 0Z00 


•1 7^0-1 1 QA 

i . / oy i i y4 


0.2199 


0.0603 


COX2 


Negative 


o.uuyooyyo 


A 07^Q77^£i 

i .z / oy / / oo 


0.202 


0.0718 


CD82 


Positive 


-1 .8999985 


-1 OH H C7 

-1 .1 / 1 1 Of 


0.1946 


0.0772 


PAI1 


Negative 


O D/77700 A 

2.94777884 


U.9 /4oUoo4 


0.1944 


0.0774 


AKT2 


Negative 


2.45598587 


1.o4bUo1oy 


0.1889 


0.0817 


HER2 


Negative 


4.2505922o 


U.yY /4o4oo 


0.1845 


0.0853 


DIABLO 


Negative 


17.035069 


z.yoyoy /41 


0.1809 


0.0884 


p27 


Positive 


-1.9798519 


A r\f~\ A A A A O 

-1 .9041142 


0.1792 


0.09 


RANBP2 


Negative 


2.85994976 


0.41 o7oo6o 


0.1757 


0.0931 


E1F4E 


Negative 






0.1722 


0.0965 


EDN1 endothelin 


Negative 


6.0685891 1 


0.87185553 


0.1688 


0.0998 


IGF1R 


Negative 


6.14387144 


1.68865744 


0.1674 


0.1012 


AKT1 


Negative 


5.02676228 


1.50585593 


0.1659 


0.1028 


CCNA2 


Negative 


3.95684559 


0.63089954 


0.184 


0.1033 


HB-EGF 


Negative 


5.1019713 


0.70368632 


0.1627 


0.1061 


TIMP2 


Positive 


2.58975885 


-1.0832648 


0.1625 


0.1064 


EGFRd27 


Positive 


-38.789016 


-5.2513587 


0.1607 


0.1083 


Chk2 


Negative 


6.8797175 


1.21671205 


0.1581 


0.1112 
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Gene Name 


Response 


Logistic Discriminat Function 


R2 


Likelihood 
Ratio Test 


Intercept 


Slope 


P Value 


IRS1 


Negative 


12.0545078 


1.59632708 


0.1578 


0.1115 


FRP1 


Negative 


O.OOZOOOQZ 




0.1569 


0.1126 


CCNE2 


Negative 


5.78828731 


1.11609099 


0.1566 


0.1129 


SPRY2 


Negative 


4.68447069 


0.86747803 


0.1552 


0.1145 


KRT17 


Negative 


0.34280253 


0.412313 


0.151 


0.1195 


DPYD 


Negative 


2.78071456 


0.78918833 


0.1504 


0.1202 


CD105 


Negative 


3.13613733 


0.51406689 


0.1391 


0.1351 


TP53BP1 


Negative 


3.18676588 


0.58622276 


0.1361 


0.1395 


PTPD1 


Negative 


5.85217342 


1 .08545385 


0.1357 


0.1401 


CTSL 


Positive 


-2.2283797 


-1.4833372 


0.1354 


0.1405 
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Table 2: Clinical Benefit Genes for Head and Neck Study 



bene INHITie 


rv.es ponse 


Logistic Discriminat 
Function 


R 2 

rv 


1 ikplihonrl 

1 • IVCII 1 IUUU 

Ratio Test 


Intercept 


Slope 


P Value 


civiet.z 


Negative 


Zo.oooZoz 


4.4UoZo/ O 




U.VJUU / 


GR01.2 


Negative 


Ad A t~\~7A "7 
1U.10Y1 / 


z.4oyu4Uoo 


u.oooo 


a r\r\A 0 

u.uu 1 y 
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Negative 


o.loZyobol 


Z.bUoo4olZ 


u.oozo 


n ni n7 

U.U I u / 


A A Q 


I i^i 4*5 1 / /™\ 

iNegauve 


( . fboZbUb 


Z.ooUboUyZ 


a r>r\A.A 
U.OU44 


u.u i y*+ 


UOKo.o 


Negative 


10.2957141 


a ocah onnc 

1 .00012990 


u.zyo 


A r\OA Q 

u.uz i y 


D-L/atenin.o 


Negative 


4.zizoAz/y 


1 .041 / / OO 


A 9"7Q«1 

u.z/ y 1 


U.UZ.OZ 


bUNi enaotneiin.i 


Negative 


D.oo(JzZo14 


A a a ccnnco 
1 .14ooUUb2 


A 07C0 

U.Z/ 00 


A AO£1 

U.UZO I 




Negative 


/ .4o/ oi oyy 


1 .Z iZ/ U/Zo 


U.ZOO I 


U.Ui£Ov7 


LAMLrZ.Z 


Negative 


•1 70ft^QQRO 

i . / yooydoz 


u.ooozooyo 


A O/tQQ 

u.Z4yo 


U.U04Z 


OU44VO. 1 


Negative 


o ccncnc77 
z.ooUoUo/ ( 


1 .0/ ooolbZ 


A OH71 
U.ZU/ I 


u.uooy 


niARi c\ a 

UlADLvJ. 1 


iNegauve 


lO.OUO I o4 1 


z.yyy i uo i z 


U.ZUuu 


n 0^49 

U.VJOH'^ 


UU44 Vo.z 


Negative 


o.uz4yzo i y 


z.uo4oyo/ i 


a onno 
u.zuuz 


u.uoo 


NlvlYL/.Z 


iNegauve 


Zo.ZU 1 UoZ/ 


O.ZU /O / OUO 


u. 1 yoo 


U.UO I 


UUoZ.o 


positive 


O 7CO *1 Q*3T 

-z./oziyo/ 


-T .ibyzzbo 


U. I OO 


n DRfi9 

U.UOOii 


DA MDDO *3 
KANdKZ.o 


fv 1 ^v^N <^4"t\ 

iNegauve 


Z.UZU/D/ oo 


U.4Z 1 / 0Z00 


A A Qn*7 
U. I ou / 


n D71 ft 

U.U/ IO 


DD A A 


nositive 


-o. / oozyo4 


- I ./04UO0 1 


n 17^1 

U. I/O I 


n CV7RA 
u.u/ 


Lid DO Q 


Negative 


o.o/ 0O41 Do 


1 . 1 1 4oOU IO 


A OQO 
U.I / O^i 


n 077Q 

U.U / / %7 


1 VI 1 A i . J 


Negative 


o.yuzuzob 


u.yzzoob4o 


n ifi9ft 

U. IO^O 


U.VJO / *+ 


ObA.o 


rositive 


-4 i .yuyooy 


-O.OOoO 1 oZ 


n iaiq 

U. IO I 5? 


u.uooo 


OfcAUAMb.l 


Negative 


i .booyoyo / 


u.oyou/ /yz 


U. IOUZ 


u.uoyy 


r I ru I .Z 


Negative 


O.D IZ4Z/DO 


1 . I OO 1 DUDO 


U. IvJU I 


n HQ01 


lzr\l\ 1 .0 


Negative 


O A A AA~7C\CZ. 

Z.41 44 ( Uo 


U. / ZU/ Zoo4 


U. I Of 


a c\Qf\A 
u.uyo*t 


D3X. 1 


INegauve 


z.yi ooozoD 


U. /ooo40 iy 


n mo 

U. 1 \JiL 


n DQP.7 

U.U530 / 


O 1 IV! YO.O 


rosiuve 


-u.yy4o/ zo 


-u.ouOoyo 1 


U. I400 


U. I uzo 


pAVO i 

OvJAZ. 1 


Negative 


o. / yz / yo i d 


1 .uo i zu 1 0 


U. IH- / O 


n in^2i 

U. 1 UOn 


tzlr4L:. 1 


Negative 


z.uouuooy / 
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U. l*tOO 
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Negative 


Z.I / OZUZ40 


c\aa ROQ^no 
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u. I 0 / 


0.1169 


r(jj|\. i . 1 


Negative 


1 86416703 


1 92395917 


U. I OOO 


n 1919 


ALTO *2 

Ar\ 1 Z.o 


Negative 


1.45131206 


1.43341036 
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U. IzlO I 


A AOQA 

u. izyf 


BTC.3 


Negative 


12.1153734 


1.67411928 


0.1281 


0.1294 


APC.4 


Negative 


2.50791938 


0.92506412 


0.128 


0.1296 


CCNE2.2 . 


Negative 


3.98727145* 


0.89372321 


0.1267 


0.1315 


OPN, osteopontin.3 


Positive 


-0.522697 


-0.5069258 


0.1225 


0.1382 


ITGA3.2 


Negative 


2.23381763 


0.3800099 


0.1203 


0.1417 


KRT17.2 


Negative 


-0.4861169 


0.43917211 


0.1184 


0.1449 


CD44S.1 


Positive 


-0.9768133 


-0.8896223 


0.118 


0.1456 


EGFR.2 


Negative 


0.43258354 


0.46719029 


0.1162 


0.1487 ! 
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Table 3: Partial Response Genes for Colon Study 



Gene Name 


Response 


Logistic Discriminat 
Function 


R 2 


Likelihood 
Ratio Test 


Intercept 




P Value 


DCIX__Z 


r osilive 


2.04896151 


-2.1025144 


0.172 


0.0801 


DDAC O 


r OblllVc 


-2.5305788 


-3.0987684 


0.2532 


0.0337 


DDI/' O 


n UolUVfc? 


-2.6096501 


-1.577388 


0.2998 


0.0209 


pAQ O 


IMtJy ClU vc 


2.65287578 


0.83720397 


0.2758 


0.0267 


UdU I / I 


Pr^cif" 

ruoiuvc 


-0.0419396 


-1.8773242 


0.2096 


0.0533 


OOlNUo I 


rOSIUVc 


-1.014844 


A A A A A m T 

-5.1111617 


0.348 


0.0128 


PPMPi -1 
OL/lNtz. I__ 1 




-6.5821701 


-0.8939912 


0.1914 


0.0648 




D s-\ i 4- *r < //^ 

nOSlII Vc 


26.1675642 


-1 .0709109 


0.1707 


0.0812 


I UU I UO I 


P/""iciti\/o 
ruoiiivc 


5.85359096 


-1 .2349006 


0.1302 


0.1278 


OLJ 1 j4 Z 




-5.9286576 


a r~ a a a r\r~ A o 

1.51119518 


0.1212 


0.1418 


OU44Vo_Z 




-1.8184898 


1 .12771829 


0.2064 


0.0552 


UUUZPD_ I 


INcydLIVc . 


10.4351019 


1.59196005 


0.2455 


0.0365 


hdc; o 


lNC?y ciLIVc? 


-1.7399226 


1 .60177OOO 


0.1759 


0.0767 






3.65681435 


-0.76043O 


0.1222 


0.1401 




PnQitix/^ 
r uoiiivc 


-2.3409861 


-1 .121/olz 


0.2542 


. 0.0333 


OroO I 


ruoi live; 


4.03889935 


»1 AAA"7C^10 

-1 .9097648 


0.3752 


0.0097 


riDni 9 




2.77545378 


0.74734483 


0.124 


0.1359 


OUO 1 


PoQitix/f^ 

l UOllI vc 


8.29578416 


-1.9015759 


0.2105 


0.0529 


nor h 


rUolllVC 


5.10609383 


-1 .1947949 


0.2361 


0.0403 


IE-/ I I 


PoQitiv/^ 


10.6703203 


-1 ,65414b 


0.216 


0.0498 


l 1 bDO 1 


Pocith/^ 

I UolLlVC 


0.79232612 


-0.o27o0o 


0.3321 


0.015 




M<ari£3ti\/f=* 
iNcy ail vc 


5.93738146 


0.93514633 


0.2133 


0.0513 


i AMP9 9 


iMcyaii vc 


-0.3325052 


1.41o4zUo4 


0.2475 


0.0357 


r I ^Mrvi _ I 


Man Cjfi\/P» 

INcydlivc 


't.Ow'tJUvJv^O 


4 10859002 


0.2946 


0.022 


r L/ur D O 


iMcyauvc 


-4.7055966 


1.96517114 


0.3299 


0.0154 


PLAUR_3 


Negative 


7.51817646 


0.6862142 


0.1534 


0.0983 


PTPD1_2 


Positive 


-11.659761 


-1.2559081 


0.1247 


0.1362 


RASSF1_3 


Negative 


6.60631474 


0.9862129 


0.1708 


0.0811 


RIZ1_2 


Negative 


2.83817546 


0.86281199 


0.1255 


0.1349 


Src_2 


Negative 


4.91364145 


1.96089745 


0.1324 


0.1247 


! TFRC_3 


Negative 


-4.0754666 


3.03617052 


0.19 


0.0658 


I TITF1_1 


Positive 


-1.8849815 


-2.1890987 


0.1349 


0.1211 


upa_3 


Negative 


4.1059421 


1.14053848 


0.1491 


0.1032 


XIAP 1 


Positive 


-16.296951 


-2.9502191 


0.2661 


0.0295 
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Table 4: Clinical Benefit Genes for Colon Study 



fipne* Nam© 


Resoonse 


Logistic Discriminat 
Function 


R 2 


Likelihood 
Ratio Test 


Intercept 


Slope 


P Value 


Bak 


Positive 


-1.347937 


-0.993212 


C\ A A DO 


U.U0U4 


BRK 


Positive 


-3.237705 


-1.1479379 


U.zob/ 


U.UU5/ 


CD134 


Negative 


9.9358537 


1.68440149 


0.1 y27 


n n -i C7 
U.UIb/ 


CD44E 


Negative 


3.188991 


0.59091622 


u.uyoo 


u.uyi b 


CD44v6 


Negative 


5.7352464 


1.77571293 


U.ZDOO 


U.UU4/ 


CDC25B 


Negative 


2.0664209 


0.67140598 


U.U/oo 


u. I^/Z 


CGA 


Negative 


2.7903424 


0.43834476 


U.I Uoo 


u.u /y4 


COX2 


Positive 


-1.262804 


-0.4741852 


Pi A700 

U.U/oo 


U. 1 oy<5 


DIABLO 


Positive 


-2.514199 


-1.0753148 


U.I Uzo 


n none 
U.UoUO 


FRP1 


Positive 


-u.^i-u i yoo 


-u.oooooyy 


u.uyo/ 


u.uyoz 


GPC3 


Positive 


-7.875276 


-1 .7437079 


U.oUoo 


U.UU/dO 


HER2 


Positive 


0.1228609 


-0.5549133 


0.073 


0.1408 


ITGB3 


Positive 


-1.593092 


-0.5249778 


0.1352 


0.045 


PPARG 


Negative 


8.6479233 


1.36115361 


0.1049 


0.0774 


PTPD1 


Positive 


-3.203607 


-1 .2049773 


0.1356 


0.0447 


RPLPO 


Positive 


3.5110353 


-1.030518 


0.0752 


0.135 


STK15 


Positive 


-0.664989 


-0.5936475 


0.0873 


0.1072 


SURV 


Positive 


-1.409619 


-0.6214924 


0.074 


0.1381 


TERC 


Positive 


1.7755749 


-0.5180083 


0.1073 


0.0742 


TGFBR2 


Positive 


1.5172396 


-0.9288498 


0.0934 


0.0957 
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Tables 6A - 6F 



Gene 




M 

iNdfTlc 


t^CI^UCI IOC 


Length. Seq ID. 


A-Catenin 


imivi^uu i yuo 


^91 ^ft/A Psat<=» f9 
I OO/Z-v-wclle. IZ. 


C GTTCC G ATC CTCTATACTGC AT 


23 


94 


A-Catenin 


mm nn 1Q(T^ 

IMIVI UU I c?UO 


^91 ^Q/ A -Pate r9 


AGGTCCCTGTTGGCCTTATAGG 


22 


95 


A-oat6nin 


nm nmpn^ 


5^479 ^/A-Patp n9 


ATGCCTACAGCACCCTGATGTCGCA 


25 


96 


Alx 1 I 


IMIVI UUJ IDO 


OUU IU/ni\ 1 I.IO 


OGCTTCTATGGCGCTGAGAT 


20 


97 


A l/T-1 
AK. 1 1 


IMIVl__UUO IDO 


^n019/AKT1 
OUU I c-l r\r\ 1 I.IO 


TCO OG GTAC ACC AC GTTCTT 


20 


98 


A VST 4 

Ar\ 1 1 


MM 00^1 R*3 

IMIVI UUJ I DO 


R477R/AKT1 

/ / u/rArx I l . [JO 


CAGCCCTGGACTACCTGCACTCGG 


24 


99 


A l/TO 

Arv 1 £. 


MM fi01R9R 


^nft9ft/AKT9 

OUOtO/rVA 1 Z..IO 


TCnTGCOACCCTTCAAACC 


19 


100 


A l/TO 

Al\ 1 Z 


MM nniR9R 


^nft9Q/AKT9 r'} 

OUO/-J?/Ar\ 1 a.I w 


G G CG GTAAATTCATC ATCG AA 


21 


101 


A l/TO 

Ar\ 1 Z 


MM nmR9R 

IMIVI UU I ozo 


04707/Ai/TO n O 


CAGGTCACGTCCGAGGTCGACACA 


24 


102 


A 


mm nnnn^fc 

IM Iv^ UUUUOO 


f ^nn99/APP f4 


GGAnAGOAGGAATGTGTTTC 


20 


103 


ArU 


mm nnnn^A 

IM lvl__U uuuoo 


^009 4/ A PP r4 
OUU/H/nr U. 1*4 


AC P C A PTC G ATTTGTTTCTG 


20 


104 


a or* 
ArU 


mm nnnn^a 

IMIVt UUUUOO 


94ftftft/APP r>4 
o*+ooo/r\ru, p*t 


CATTGGCTCCCCGTGACCTGTA 


22 


105 


D-L»aLenin 


mm nniQnj. 
imivi^ uu iyu*T 


^91 ^O/R-P^tp f3 

OZ IUU/D "udlC.IO 


GGCTCTTGTGCGTACTGTCCTT 


22 


106 


b-oatenin 


mm nniQnj. 

IM IVI UU 1 v3U*r 


^91 51/R-Patp 

O/. ID I / D-\-/dlC. I O 


TPAGATGACGAAGAGCACAGATG 


23 


107 


b-L>aienin 


mm nniQOA 

IM IVI UU 1 oUt- 




AGGCTCAGTGATGTCTTCCCTGTCACCAG 


29 


108 


D3K 


mm nnnfta 

IMIVI UU 1 IOO 




CC ATTC G C AC C ATTCT AC CT 


20 


109 


iz- 
ba k 


mm nnuRR 

IMIV1__UU I IOO 


ouuoo/ ocir\. i 


GG GAAC AT AG ACCC ACC AAT 


20 


110 




mm nm 1 rr 

IMIVI UU I I OO 


S4794/Rak d9 


- ACACCCCAGACGTCCTGGCCT 


21 


111 


Bax 


mm 004^94 

■ IMIVI U UHOZ-t 


^nn^n/Ray fi 

OUUtU/DaA.! 1 


CCGCCGTGGACACAGACT 


18 


112 


Bax 


MM 004^94 

IMIVI UUHOZ't 


OUUHil/DaA.I 1 


TTGCCGTCAGAAAACATGTCA 


21 


113 


DdX 


MM 004^94 

IM IVI U U40/. t r 


O^vJJ? / / LJC3 A.. 1 


TGCCACTCGGAAAAAGACCTCTCGG 


25 


114 


QC1X 


MM PI011Q1 

IM tv 1. U U 1 1 v/ 1 


^0n4R/Rrlx f? 

OwutU/ LJUIA., 1 ^_ 


CTTTTGTGGAACTCTATGGGAACA 


24 


115 


bCIX 


imivi uu i i y i 


OUU*rO/ DUA, 1 ^_ 


C AGP GGTTG AAGCGTTCCT 


19 


116 


bCIX 


mm nnnQi 

IM IVI UU I I S7 1 




TTCGGCTCTCGGCTGCTGCA 


20 


117 


DKAr 


MM ODA^^ 

IM 1 V l__U Un O O O 


^^D97/RRAF f? 
ooul/ / pr\nr ,\c- 


CCTTCCGACCAGCAGATGAA 


20 


118 


bKAr 


mm nn4^^^ 

IMIVI UUH-OOO 


^^n9R/RRAF r9 


TTT ATATG C AC ATTG GG AG CTG AT 


24 


119 


DDAC 

bKAr 


MM OdA^^ 

IMIVI UUH-OOO 


^4R1R/RRAF n9 


OAATTTG GG CAAC GAG ACC G ATC CT 


25 4 


120 


bKrx 


MM 00^0,7^ 
IMIVI UUvJi/ / U 


ouu / u/ ur\i\. \£- 


GTGCAGGAAAGGTTCACAAA 


20 


121 


DDI/ 

Dr\r\ 


MM 00^07^ 

IMIV^ UUOC3 / Z> 


c *nR7Q/RRK r9 


GCACACACGATGGAGTAAGG 


20 


122 


DDI/ 

Drxrx 


MM nn^Q7 f: i 

IMIVI UUUC7 ) U 


S47fiQ/BRK d2 


AGTGTCTGCGTCCAATACACGCGT 


24 


123 


D 1 O 


MM HO 179.Q 

IMIVI U\J I / 


^1916/BTC f3 


AGGGAGATGCCGCTTCGT 


18 


124 


DTP 
D I L/ 


mm nrii79Q 

IMIVI UU I / £-<3 


O \ C- t f l L_> i w . I O 


CTCTCACACCTTGCTCCAATGTA 


23 


125 


DTP 

b 1 u 


MM HOI 79 Q 
In IVI UU I /Za 


<54ft44/RTr n 1 ^ 


C CTTC ATC AC AG AC AC AG G AGGG CG 


25 


126 


uAy 


mm nni9ifi 

iNlVI^JJU lc lu 




ATCCTAGCCCTGG 1 1 1 1 1 GG 


20 


127 


PAQ 

uAy 


MM n0191R 

IMIVI__UU JO 


O l \JZ/Zfi VAC 1 O 


CTGCCTTCTCATCTGCACAA 


20 


128 


pAQ 


mm nni9ifi 

IMIVI UU I ^ 1 vj 


S4938/CA9 d3 


TTTGCTGTCACCAGCGTCGC 


20 


129 


ParH7 


mm nn4nR^ 

IMIVI UUHULIJ 


O 1 UU/uaU li.lt 


GAAGGCCAAGAACCGAGTCA 


20 


130 


1/dU I / 


IMIVI UU4UUJ 


0£. 1 U f / \a/CIU 1 i.l I 


TC CC C AGTT AGTTC AAAAGTC AC A 


24 


131 


Cad17 


NMJD04063 


S5038/Cad17.p1 


TTATATTCCAGTTTAAGGCCAATCCTC 


27 


132 


CCNA2 


NM_001237 


S3039/CCNA2.f1 


CCATACCTCAAGTATTTGCCATCAG 


25 


133 


CCNA2 


NM„001237 


S3040/CCNA2.r1 


AG CTTTGTC CCGTG ACTGTGTA 


22 


134 


CCNA2 


NM_001237 


S4820/CCNA2.p1 


ATTG CTG GAG CTGCC 1 I 1 CATTTAGCACT 


29 


135 


CCND3 


NM_001760. 


S2799/CCND3.f1 


CCTCTGTGCTACAGATTATACCTTTGC 


27 


136 


CCND3 


NM_001760 


S2800/CCND3.r1 


CACTGCAGCCCCAATGCT 


18 


137 


CCND3 


NMJ)01760 


S4966/CCND3.p1 


TACCCGCCATCCATGATCGCCA 


22 


138 


CCNE1 


NMJD01238 


S1446/CCNE1.f1 


AAAGAAGATGATGACCGGGTTTAC 


24 


139 


CCNE1 


NM 001238 


S1447/CCNE1.r1 


GAGCCTCTGGATGGTGCAAT 


20 


140 
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CCNE1 


NM_001238 


S4944/CCNE1.p1 


CAAACTCAACGTGCAAGCCTCGGA 


24 


141 


CCNE2 


NM_057749 


S1458/CCNE2.f2 


ATGCTGTGGCTCCTTCCTAACT 


22 


142 


CCNE2 


NMJ357749 


S1459/CCNE2.r2 


ACCCAAATTGTGATATACAAAAAGGTT 


27 


143 


CCNE2 


NM_057749 


S4945/CCNE2.p2 


TAC C AAG C AAC CTAC ATGTC AAG AAAG CCC 


30 


144 


CD 105 


NMJD00118 


Sl410/CD105.f1 


GCAGGTGTCAGCAAGTATGATCAG 


24 


145 


CD105 


NMJ300118 


S1411/CD105.M 


TTTTTC CG CTGTGGTG ATG A 


20 


146 


CD105 


NM_000118 


S4940/CD105.p1 


CGACAGGATATTGACCACCGCCTCATT 


27 


147 


CD134 


NMJ)03327 


S3138/CD134.f2 


GCCCAGTGCGGAGAACAG 


18 


148 


CD134 


NM_003327 


S3139/CD134.r2 


AATCACACGCACCTGGAGAAC 


21 


149 


CD134 


NM_003327 


S3241/CD134.p2 


CCAGCTTGATTCTCGTCTCTGCACTTAAGC 


30 


150 


CD44E 


X55150 


S3267/CD44E.f1 


ATCACCGACAGCACAGACA 


19 


151 


CD44E 


X55150 


S3268/CD44E.r1 


ACCTGTGTTTGGATTTGCAG 


20 


152 


CD44E 


X55150 


S4767/CD44E.p1 


CCCTGCTACCAATATGGACTCCAGTCA 


27 


153 


CD44s 


M59040 


S3102/CD44s.f1 


GACGAAGACAGTCCCTGGAT 


20 


154 


CD44s 


M59040 


S3103/CD44s.r1 


ACTG GG GTG G AATGTGTCTT 


20 


155 


CD44S 


M59040 


S4826/CD44s.p1 


CACCG AC AGC AC AG AC AG AATC CC 


24 


156 


CD44V3 


AJ251595V3 


S2997/CD44v3.f2 


CACACAAAACAGAACCAGGACT 


22 


157 


CD44V3 


AJ251595v3 


S2998/CD44v3.r2 


CTG AAGTAG C ACTTCC G G ATT 


21 


157 


CD44V3 


AJ251595v3 


S4814/CD44v3.p2 


ACCCAGTGGAACCCAAGCCATTC 


23 


159 


CD44v6 


AJ251595v6 


S3003/CD44v6.f1 


CTCATACCAGCCATCCAATG 


20 


160 


CD44v6 


AJ251595v6 


S3004/CD44v6.r1 


TTG G GTTG AAG AAATC AGTCC 


21 


161 


CD44v6 


AJ251595v6 


S4815/CD44v6.p1 


CACCAAGCCCAGAGGACAGTTCCT 


24 


162 


CD68 


NM_001251 


S0067/CD68.f2 


TGGTTCCCAGCCCTGTGT 


18 


163 


CD68 


NMJ)01251 


S0069/CD68.r2 


CTCCTCCACCCTGGGTTGT 


19 


164 


CD68 


NM_001251 


S4734/CD68.p2 


CTCCAAGCCCAGATTCAGATTCGAGTCA 


28 


165 


CD82 


NM_002231 


S0684/CD82.f3 


GTGCAGGCTCAGGTGAAGTG 


20 


166 


CD82 


NMJ302231 


S0685/CD82.r3 


GACCTCAGGGCGATTCATGA 


20 


167 


CD82 • 


NMJD02231 


S4790/CD82.p3 


TCAGCTTCTACAACTGGACAGACAACGCTG 


30 


168 


CD9 


NMJ301769 


S0686/CD9.f1 


GGGCGTGGAACAGTTTATCT 


20 


168 


CD9 


NMJ)01769 


S0687/CD9.r1 


CACGGTGAAGGTTTCGAGT 


19 


170 


CD9 


NM_001769 


S4792/CD9.p1 


AGACATCTGCCCCAAGAAGGACGT 


24 


171 


CDC25B 


NMJD21874 


S1160/CDC25B.f1 


AAACGAGCAGTTTGCCATCAG 


21 


172 


CDC25B 


NM_021874 


S1161/CDC25B.r1 


GTTG GTG ATGTTC CG AAG C A 


20 


176 


CDC25B 


NM_021874 


S4842/CDC25B.p1 


CCTCACCGGCATAGACTGGAAGCG 


24 


174 


CEACAM6 


NMJ)02483 


S3197/CEACAM.f1 


CAC AG C CTC ACTTCTAAC CTTCTG 


24 


175 


CEACAM6 


NMJXJ2483 


S3198/CEACAM.M 


TTGAATGGCGTGGATTCAATAG 


22 


176 


CEACAM6 


NM_002483 


S3261/CEACAM.p1 


ACCCACCCACCACTGCCAAGCTC 


23 


177 


CGA 


NM_001275 


S3221/CGA.f3 


CTGAAGGAGCTCCAAGACCT 


20 


178 


CGA 


NM_001275 


S3222/CGAT3 


C AAAAC CG CTGTG I I I CTTC 


20 


179 


CGA 


NM_001275 


S3254/CGA.p3 


TGCTGATGTGCCCTCTCCTTGG 


22 


180 


Chk2 


NMJD07194 


S1434/Chk2.f3 


ATGTGGAACCCCCACCTACTT 


21 


181 


Chk2 


NMJ307194 


S1435/Chk2.r3 


CAGTCCACAGCACGGTTATACC 


22 


182 


Chk2 


NM_007194 


S4942/Chk2.p3 


AGTCCCAACAGAAACAAGAACTTCAGGCG 


29 


183 


cMet 


NM_000245 


S0082/cMet.f2 


G AC ATTTC C AGTC CTGC AGTC A 


22 


184 


cMet 


NMJ500245 


S0084/cMet.r2 


CTCCGATCGCACACATTTGT 


20 


185 


cMet 


NM_000245 


S4993/cMet.p2 


TGCCTCTCTGCCCCACCCTTTGT 


23 


186 


COX2 


NM_000963 


S0088/COX2'.f1 


TCTGCAGAGTTGGAAGCACTCTA 


23 


187 


COX2 


NM_000963 


S0090/COX2.M 


GCCGAGGC 1 1 1 1 CTACCAGAA 


21 


188 


COX2 


NM_000963 


S4995/COX2.p1 


CAGGATACAGCTCCACAGCATCGATGTC 


28 


189 


cripto 


NMJD03212 


S3117/cripto.f1 


GGGTCTGTGCCCCATGAC 


18 


190 
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cripto 


NM_003212 


S3118/cripto.r1 


TGACCGTGCCAGCATTTACA 


20 


191 


cripto 


NMJD03212 


S3237/cripto.p1 


CCTGGCTGCCCAAGAAGTGTTCCCT 


25 


192 


CTSL 


NMJ)01912 


S1303/CTSL.f2 


GGGAGGCTTATCTCACTGAGTGA 


23 


193 


CTSL 


NM_001912 


S1304/CTSL.r2 


CCATTGCAGCCTTCATTGC 


19 


194 


CTSL 


NMJ)01912 


S4899/CTSL.p2 


TTGAGGCCCAGAGCAGTCTACCAGATTCT 


29 


195 


DCR3 


NM_0 16434 


S1786/DCR3.f3 


GACCAAGGTCCTGGAATGTC 


20 


196 


DCR3 


NMJ) 16434 


S1787/DCR3.r3 


GTCTTCCCTGTACCCGTAGG 


20 


197 


DCR3 


NM_0 16434 


S4982/DCR3.p3 


CAGGATGCCATTCACCTTCTGCTG 


24 


198 


DIABLO 


NMJ) 19887 


S0808/DIABLO.f1 


CACAATGGCGGCTCTGAAG 


19 


199 


ni AE3I o 
UIAdLU 


iMivi u i aoo / 




ArAOAAAOACTGTCTGTACCTGAAGA 




200 


DIABLO 


NM_019887 


S4813/DIABLO.p1 


AAGTTACGCTGCGCGACAGCCAA 


23 


201 


DPYD 


NM_000110 


S0100/DPYDT2 


AGGACGCAAGGAGGGTTTG 


19 


202 


DPYD 


NM__000110 


S0102/DPYD.r2 


GATGTCCGCCGAGTCCTTACT 


21 


203 


DPYD 


NMJ300110 


S4998/DPYD.p2 


C AGTG C CTAC AGTCTC G AGTCTGC C AGTG 


29 


204 


DR5 


Nlvl_003842 


S25o1/DRo.f2 


U 1 O 1 oAoAUAb 1 I i UuA 1 bAU 1 


O/t 


one 


DR5 


NM_0 03842 


S2552/DR5.r2 


CGA1 GA(j(jCOCAAC 1 1 UU ! 


•1 Q 

iy 




DR5 

rrpi KM 


NM_0 03842 


S4979/DKo.p2 


CAvjAGI 1 bo 1 bOOO 1 1 1 bAU 1 Ob 






endothelin 


NM_001955 


S0774/EDN1 e.fl 


TGCCACCTGGACATCATTTG 


20 


208 


EDN1 












endotheltn 


NMJ301955 


S0775/EDN1 e.ii 


TGGACCTAGGGCTTCCAAGTC 


21 


209 


EDN1 
endothelin 


NM_001955 


S4806/EDN1 e.pl 


CACTCCCGAGCACGTTGTTCCGT 


23 


210 


EGFR 


NM_005228 


S0103/EGFRT2 


TGTCGATGGACTTCCAGAAC 


20 


211 


EGFR 


NMJ305228 


S0105/EGFR.r2 


ATTGGGACAGCTTGGATCA 


19 


212 


EGFR 


NM_005228 


S4999/EGFR.p2 


CACCTGGGCAGCTGCCAA 


18 


213 


EGFRd27 


EGFRd27 


S2484/EGFRd2.f2 


GAGTCGGGCTCTGGAGGAAAAG 


22 


214 


EGFRd27 


EGFRd27 


S2485/EGFRd2.r2 


CCACAGGCTCGGACGCAC 


'18 


215 


EGFRd27 


EGFRd27 


S4935/EGFRd2.p2 


AGCCGTGATCTGTCACCACATAATTACC 


28 


216 


EIF4E 


NM_001968 


S0106/EIF4E.f1 


GATCTAAGATGGCGACTGTCGAA 


23 


217 


EIF4E 


NM_001968 


S0108/EIF4E.M 


TTAGATTCCG Mil CTCCTCTTCTG 


25 


218 


EIF4E 


NM_001968 


S5000/EIF4E.p1 


ACCACCCCTACTCCTAATCCCCCGACT 


27 


219 


ErbB3 


NM_001982 


S0112/ErbB3.fi 


CG GTTATGTC ATG C C AG ATAC AC 


23 


220 


ErbB3 


NM_001982 


S0114/ErbB3.r1 


GAACTGAGACCCACTGAAGAAAGG 


24 


221 


ErbB3 


NM_001982 


S5002/ErbB3.p1 


CCTCAAAGGTACTCCCTCCTCCCGG 


25 


222 


ERBB4 


NM_005235 


S1231/ERBB4.f3 


TGGCTCTTAATCAGTTTCGTTACCT 


25 


223 


ERBB4 


NM_005235 


S1232/ERBB4.r3 


C AAG GC AT ATC G ATC CTC ATAAAGT 


25 


224 


ERBB4 


NMJ305235 


S4891/ERBB4.p3 


TGTCCCACGAATAATGCGTAAATTCTCCAG 


30 


225 


EREG 


NM_001432 


S0670/EREG.f1 


ATAACAAAGTGTAGCTCTGACATGAATG 


28 


226 


EREG 


NM_001432 


S0671/EREG.r1 


CACACCTGCAGTAG 1 1 1 1 GACTCA 


24 


227 


EREG 


NMJD01432 


S4772/EREG.p1 


TTGTTTGCATGGACAGTGCATCTATCTGGT 


30 


228 


ERK1 


Z11696 


S1560/ERK1.f3 


ACGGATCACAGTGGAGGAAG 


20 


229 


ERK1 


Z11696 


S1561/ERK1.r3 


CTCATCCGTCGGGTCATAGT 


20 


230 


ERK1 


Z11696 


S4882/ERK1.p3 


CGCTGGCTCACCCCTACCTG 


20 


231 


fas 


NM_000043 


,S0118/fas.f1 


GGATTGCTCAACAACCATGCT 


21 


232 


fas 


NM_000043 


S0120/fas.r1 


GGCATTAACAC 1 1 1 I GGACGATAA 


24 


233 


fas 


NM_000043 


S5003/fas.p1 


TCTGGACCCTCCTACCTCTGGTTCTTACGT 


30 


234 


FRP1 


NMJD03012 


S1804/FRP1.f3 


TTGGTACCTGTGGGTTAGCA 


20 


235 


FRP1 


NMJ303012 


S1805/FRP1.r3 


CACATCCAAATGCAAACTGG 


20 


236 


FRP1 


NMJD03012 


S4983/FRP1.p3 


TCCCCAGGGTAGAATTCAATCAGAGC 


26 


237 


GPC3 


NMJD04484 


S1835/GPC3.f1 


TGATGCGCCTGGAAACAGT 


19 


238 
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GPC3 


NMJ)04484 


S1836/GPC3.M 


CGAGGTTGTGAAAGGTGCTTATC " 


23 


239 


GPC3 


NM_004484 


S5036/GPC3.p1 


AGC AG G C AACTCC G AAG G ACAACG 


24 


240 


GRQ1 


NM_001511 


S0133K3RO1.f2 


CGAAAAGATGCTGAACAGTGACA 


23 


241 


GROI 


NM„001511 


S0135/GRO1.r2 


TC AG G AAC AG C C ACC AG TG A 


20 


242 


GRQ1 


NMJ)01511 


S5006/GRO1.p2 


CTTCCTCCTCCCTTCTGGTCAGTTGGAT 


28 


243 


GUS 


NM_0 00181 


S0139/GUS.f1 


CCCACTCAGTAGCCAAGTCA 


20 


244 


GUS 


NIVL000181 


S0141/GUS.M 


CACGCAGGTGGTATCAGTCT 


20 


245 


GUS 


NMJD00181 


S4740/GUS.p1 


TCAAGTAAACGGGCTG i I I I CCAAACA 


27 


246 


HB-EGF 


NMJ)01945 


S0662/HB-EGF.f1 


GACTCCTTCGTCCCCAGTTG 


20 


247 


HB-EGF 


NM_001945 


S0663/HB-EGF.r1 


TG GC ACTTG AAGG CTCTG GTA 


21 


248 


HB-EGF 


NM_001945 


S4787/HB-EGF.p1 


TTGGGCCTCCCATAATTGCTTTGCC 


25 


249 


HER2 


NM_004448 


S0142/HER2.f3 


CGGTGTGAGAAGTGCAGCAA 


20 


250 


HER2 


NM_004448 


S0144/HER2.r3 


CCTCTCGCAAGTGCTCCAT 


19 


251 


HER2 


NM_004448 


S4729/HER2.p3 


CCAGACCATAGCACACTCGGGCAC . 


24 


242 


HGF 


M29145 


S1327/HGF.f4 


CCGAAATCCAGATGATGATG 


20 


253 


HGF 


M29145 


S1328/HGF.r4 


' CCCAAGGAATGAGTGGATTT 


20 


254 


HGF 


M29145 


S4901/HGF.p4 


CTCATGGACCCTGGTGCTACACG 


23 


255 


ID1 


NM_002165 


S0820/ID1i1 


AGAACCGCAAGGTGAGCAA 


19 


256 


ID1 


NM_002165 


S0821/1D1.M 


TCCAACTGAAGGTCCCTGATG 


21 


257 


ID1 


NM_002165 


S4832/ID1.p1 


TGGAGATTCTCCAGCACGTCATCGAC 


26 


258 


IGF1R 


NM_000875 


S1249/IGF1R.f3 


GCATGGTAGCCGAAGATTTCA 


21 


259 


IGF1R 


NMJ)00875 


S1250/IGF1R.r3 


TTTCCGGTAATAGTCTGTCTCATAGATATC 


30 


260 


IGF1R 


NMJD00875 


S4895/IGF1R.p3 


CGCGTCATACCAAAATCTCCGA I I I I GA 


28 


261 


IGFBP3 


NM_000598 


S0157/IGFBP3.f3 


ACGCACCGGGTGTCTGA 


17 


262 


IGFBP3 


NM__000598 


S0159/iGFBP3.r3 


TGCCCTTTCTTGATGATGATTATC 


24 


263 


(GFBP3 


NMJ)00598 


S5011/IGFBP3.p3 


CCCAAGTTCCACCCCCTCCATTCA 


24 


264 


IRS1 


NMJD05544 


S1943/IRS1.f3 


CCACAGCTCACCTTCTGTCA 


20 


265 


IRS1 


NM_005544 


S1944/IRS1.r3 


CCTCAGTGCCAGTCTCTTCC 


20 


266 


IRS1 


NMJD05544 


S5050/lRS1.p3 


TCCATCCCAGCTCCAGCCAG 


20 


267 


ITGA3 


NM_002204 


S2347/ITGA3.f2 


CCATGATCCTCACTCTGCTG 


20 


268 


ITGA3 


NM_002204 


S2348/ITGA3.r2 


G AAG CTTTGTAGCCG GTG AT 


20 


269 


ITGA3 


NM_002204 


S4852/ITGA3.p2 


C ACTCC AG AC CTCG CTTAG C ATG G 


24 


270 


ITGB3 


NM_000212 


S3126/ITGB3.f1 


ACCGGGAGCCCTACATGAC 


19 


271 


ITGB3 


NM_000212 


S3127/ITGB3.r1 


CCTTAAGCTCTTTCACTGACTCAATCT 


27 


272 


ITGB3 


NMJ300212 


S3243/ITGB3.p1 


AAATACCTGCAACCGTTACTGCCGTGAC 


28 


273 


KRT17 


NM__000422 


S0172/KRT17.f2 


CGAGGATTGGTTCTTCAGCAA 


21 


274 


KRT17 


NMJ300422 


S0174/KRT17.r2 


ACTCTGCACCAGCTCACTGTTG 


22 


275 


' KRT17 


NM_000422 


S5013/KRT17.p2 


CACCTCGCGGTTCAGTTCCTCTGT 


24 


276 


LAMC2 


NM_005562 


S2826/LAMC2.f2 


ACTCAAGCGGAAATTGAAGCA 


21 


277 


LAMC2 


NM_005562 


S2827/LAMC2.r2 


ACTC CCTG AAG C C GAG AC ACT 


21 


278 


LAMC2 


NM_005562 


S4969/LAMC2.p2 


AGGTCTTATCAGCACAGTCTCCGCCTCC 


28 


278 


MTA1 


NM_004689 


S2369/MTA1.f1 - 


CCGCCCTCACCTGAAGAGA 


19 


280 


MTA1 


NM_004689 


S2370/MTA1.M 


G G AATAAGTT AG C C G C GCTTCT 


22 


281 


MTA1 


NM_004689 


S4855/MTA1.p1 


CCCAGTGTCCGCCAAGGAGCG 


21 


282 


NMYC 


NMJD05378 


S2884/NMYC.f2 


TGAGCGTCGCAGAAACCA 


18 


283 


NMYC 


NM_005378 


S2885/NMYC.r2 


TCCCTGAGCGTGAGAAAGCT 


20 


284 


NMYC 


NM_005378 


S4976/NMYC.p2 


CCAGCGCCGCAACGACCTTC 


20 


285 


p14ARF 


NM_000077 


S0199/p14ARF.f3 


GCGGAAGGTCCCTCAGACA 


19 


286 


p14ARF 


NMJ300077 


S0201/p14ARF.r3 


TCTAAGTTTCCCGAGGTTTCTCA 


23 


297 


p14ARF 


NM_000077 


S5068/p14ARF.p3 


CCCCGATTGAAAGAACCAGAGAGGCT 


26 


288 
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p27 


NM_004064 


S0205/p27.f3 


CGGTGGACCACGAAGAGTTAA 


21 


289 


p27 


NM_004064 


S0207/p27.r3 


GG CTC G CCTCTTC C ATGTC 


19 


290 


p27 


NM_004064 


S4750/p27.p3 


C CG GG ACTTG G AG AAG C ACTG C A 


23 


291 


P53 


NMJD00546 


S0208/P53.f2 


CTTTGAACCCTTGCTTGCAA 


20 


292 


P53 


NMJD00546 


S0210/P53.r2 


CCCGGGACAAAGCAAATG 


18 


293 


P53 


NM_000546 


S5065/P53.p2 


AAGTCCTGGGTGCTTCTGACGCACA 


25 


294 


PA11 


NMJ300602 


S0211/PAI1.f3 


CCGCAACGTGG I I I I CTCA 


19 


295 


PAI1 


NM_000602 


S0213/PAI1.r3 


TGCTGGGTTTCTCCTCCTGTT 


21 


296 


PAI1 


NM_000602 


S5066/PAl1.p3 


CTCGGTGTTGGCCATGCTCCAG 


22 


297 


PDGFA 


NM_002607 


S0214/PDGFA.f3 


TTGTTGGTGTGCCCTGGTG 


19 


298 


PDGFA 


NM_002607 


S0216/PDGFA.r3 


TGGGTTCTGTCCAAACACTGG 


21 


299 


PDGFA 


NM_002607 


S5067/PDGFA.p3 


TGGTGGCGGTCACTCCCTCTGC 


22 


300 


PDGFB 


NM_002608 


S0217/PDGFB.f3 


ACTGAAGGAGACCCTTGGAG 


20 


301 


PDGFB 


NM_002608 


S0219/PDGFB.r3 


TAAATAACC CTG C C C AC AC A 


20 


302 


PDGFB 


NM_002608 


S5014/PDGFB.p3 


TCTCCTGCCGATGCCCCTAGG 


21 


303 


PGK1 


NMJD00291 


S0232/PGK1.f1 


AGAGCC AGTTGCTGTAGAACTCAA 


24 


304 


PGK1 


NM_000291 


S0234/PGK1.M 


CTGGGCCTACACAGTCCTTCA 


21 


305 


PGK1 


NM_000291 


S5022/PGK1.p1 


TCTCTGCTGGGCAAGGATGTTCTGTTC 


27 


306 


PLAUR 


NM_002659 


S1976/PLAUR.f3 


CCCATGGATGCTCCTCTGAA 


20 


307 


PLAUR 


NMJ)02659 


S1977/PLAUR.r3 


CCGGTGGCTACCAGACATTG 


20 


308 


PLAUR 


NM_002659 


S5054/PLAUR.p3 


CATTGACTGCCGAGGCCCCATG 


22 


309 


PPARG 


NM_005037 


S3090/PPARG.f3 


TGACTTTATGGAGCCCAAGTT 


21 


310 


PPARG 


NMJ)05037 


S3091/PPARG.r3 


GCCAAGTCGCTGTCATCTAA 


20 


311 


PPARG 


NM_005037 


S4824/PPARG.p3 


TTC C AGTGC ATTG AACTTC AC AG C A 


25 


312 


PTPD1 


NMJD07039 


S3069/PTPD1.f2 


CGCTTGCCTAACTCATACTTTCC 


23 


313 


PTPD1 


NMJD07039 


S3070/PTPD1.r2 


CCATTCAGACTGCGCCACTT 


20 


314 


PTPD1 


NMJD07039 


S4822/PTPD1.p2 


TCCACGCAGCGTGGCACTG 


19 


315 


RANBP2 


NM_006267 


S3081/RANBP2.f3 


TCCTTCAGCTTTCACACTGG 


20 


316 


RANBP2 


NMJ306267 


S3082/RANBP2.r3 


AAATCCTGTTCCCACCTGAC 


20 


317 


RANBP2 


NM_006267 


S4823/RANBP2.p3 


TCCAGAAGAGTCATGCAACTTCATTTCTG 


29 


318 


RASSF1 


NMJ307182 


S2393/RASSF1.f3 


AGTGGGAGACACCTGACCTT 


20 


319 


RASSF1 


NM_007182 


S2394/RASSF1.r3 


TGATCTGGGCATTGTACTCC 


20 


320 


RASSF1 


NMJD07182 


S4909/RASSF1.p3 


TTGATCTTCTGCTCAATCTCAGCTTGAGA 


29 


321 


RB1 


NM_000321 


S2700/RB1.ff 


CGAAGCCCTTACAAGTTTCC 


20 


322 


RB1 


NMJD00321 


S2701/RB1.r1 


GGACTCTTCAGGGGTGAAAT 


20 


323 


RB1 


NMJ)00321 


S4765/RB1.p1 


CCCTTACGGATTCCTGGAGGGAAC 


24 


324 


RIZ1 


NMJD12231 


S1320/RIZ1.f2 


C C AG AC GAG C G ATTAG AAG C 


20 


325 


RIZ1 


NM_012231 


S1321/RlZ1.r2 


TCCTCCTCTTCCTCCTCCTC 


20 


326 


RIZ1 


NMJD12231 


S4761/RIZ1.p2 


TGTGAGGTGAATGATTTGGGGGA 


23 


327 


RPLPO 


NM_001002 


S0256/RPLPO.f2 


C C ATTCTATCATC AACG G GTACAA 


24 


328 


RPLPO 


NM_001002 


. S0258/RPLPO.r2 


TCAGCAAGTGGGAAGGTGTAATC 


23 


329 


RPLPO 


NM_001002 


S4744/RPLPO.p2 


TCTCCACAGACAAGGCCAGGACTCG 


25 


330 


SPRY2 


NM_005842 


S2985/SPRY2.f2 


TGTGGCAAGTGCAAATGTAA 


20 


331 


SPRY2 


NM_005842 


S2986/SPRY2.r2 


GTCGCAGATCCAGTCTGATG 


20 


332 


SPRY2 


NM_005842 


S4811/SPRY2.p2 


' CAGAGGCCTTGGGTAGGTGCACTC 


24 


333 


Src 


NM_004383 


S1820/Src.f2 


CCTGAACATGAAGGAGCTGA 


20 


334 


Src 


NM_004383 


S182l/Src.r2 


CATC AC GTCTC C G AACTC C 


19 


335 


Src 


NMJ304383 


S5034/Src.p2 


TCCCGATGGTCTGCAGCAGCT 


21 


336 


STK15 


NM__003600 


S0794/STK15.f2 


CATCTTCCAGGAGGACCACT 


20 


337 


STK15 


NMJJ03600 


S0795/STK15.r2 


TCC G AC CTTC AATC ATTTC A 


20 


338 
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STK15 


NM_003600 


S4745/STK15.p2 


CTCTGTGGCACCCTGGACTACCTG 


24 


339 


SURV 


NM_001168 


S0259/SURV,f2 


TGI I I I GATTCCCGGGCTTA 


20 


340 


SURV 


NM_001168 


S0261/SURV.r2 


CAAAGCTGTCAGCTCTAGCAAAAG 


24 


341 


SURV 


NM_001168 


S4747/SURV.p2 


TGCCTTCTTCCTCCCTCACTTCTCACCT 


28 


342 


TERC 


U86046 


S2709/TERC.f2 


AAGAGGAACGGAGCGAGTC 


19 


343 


TERC 


U86046 


S2710/TERC.r2 


ATGTGTGAGCCGAGTCCTG 


19 


344 


TERC 


U86046 


S4958/TERC.p2 


CACGTCCCACAGCTCAGGGAATC 


23 


345 


TFRC 


NM_003234 


S1352/TFRC.f3 


GCCAACTGCTTTCATTTGTG 


20 


346 


TFRC 


NM__003234 


S1353/TFRC.r3 


ACTCAGGCCCA I I I CCTTTA 


20 


347 


TFRC 


NM_003234 


S4748/TFRC.p3 


AGGGATCTGAACCAATACAGAGCAGACA . 


28 


348 


TGFBR2 


NM_003242 


S2422/TGFBR2.f3 


AAC AC C AATG G GTTCC ATCT 


20 


349 


TGFBR2 


NMJD03242 


S2423/TGFBR2.r3 


CCTCTTC ATC AG GC C AAACT 


20 


350 


TGFBR2 


NM_003242 


S4913/TGFBR2.p3 


TTCTGGGCTCCTGATTGCTCAAGC 


24 


351 


TIMP2 


NM_003255 


S1680/TIMP2.f1 


TCACCCTCTGTGACTTCATCGT 


22 


352 


TIMP2 


NM_003255 


S1681/TIMP2.M 


TGTGGTTCAGGCTCTTCTTCTG 


22 


353 


TIMP2 


NM_003255 


S4916/TIMP2,p1 


CCCTGGGACACCCTGAGCACCA 


22 


354 


TITF1 


NM_003317 


S2224/TlTF1.f1 


CGACTCCGTTCTGAGTGTCTGA 


22 


355 


TITF1 


NMJ303317 


S2225/TITF1.r1 


CCCTGCATGCCCACTTTCT 


19 


356 


TITF1 


NM_003317 


S4829/TITF1.p1 


ATCTTGAGTCCCCTGGAGGAAAGC 


24 


357 


TP53BP1 


NM_005657 


S1747/TP53BPT2 


TGCTGTTGCTGAGTCTGTTG 


20 


358 


TP53BP1 


NM_005657 


S1748/TP53BP.r2 


CTTGCCTGGCTTCACAGATA 


20 


359 


TP53BP1 


NM_005657 


S4924/TP53BP.p2 


CCAGTCCCCAGAAGACCATGTCTG 


24 


360 


upa 


NMJJ02658 


S0283/upa.f3 


GTGGATGTGCCCTGAAGGA 


19 


361 


upa 


NM__002658 


S0285/upa.r3 


CTGCGGATCCAGGGTAAGAA 


20 


362 


upa 


NMJ)02658 


S4769/upa.p3 


AAGCCAGGCGTCTACACGAGAGTCTCAC 


28 


363 


VEGFC 


NM_005429 


S2251A/EGFC.f1 


C CTC AG C AAG ACGTT ATTTG AAATT 


25 


364 


VEGFC 


NMJ305429 


S2252A/EGFC.M 


AAGTGTGATTGGCAAAACTGATTG 


24 


365 


VEGFC 


NM_005429 


S4758A/EGFC.p1 


CCTCTCTCTCAAGGCCCCAAACCAGT 


26 


366 


XIAP 


NM_001167 


S0289/XIAPJ1 


GCAGTTGGAAGACACAGGAAAGT 


23 


367 


XIAP 


NM_001167 


S0291/XIAP.r1 


TGCGTGGCACTA I I I ICAAGA 


21 


368 


XIAP 


NMJD01167 


S4752/XIAP.p1 


TCCCCAAATTGCAGA I I I ATCAACGGC 


27 


369 


YB-1 


NMJD04559 


S1194/YB-1.f2 


AGACTGTGGAGTTTGATGTTGTTGA 


- 25 


370 


YB-1 


NM_004559 


S1195/YB-1.r2 


GGAACACCACCAGGACCTGTAA 


22 


371 


YB-1 


NMJ304559 


S4843/YB-1 .p2 


TTGCTGCCTCCGCACCC I I I I CT 


23 


372 
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WHAT IS CLAIMED IS: 



1 1 . A prognostic method comprising: 

2 (a) subjecting a sample comprising EGFR-expressing cancer cells obtained 

3 from a patient to quantitative analysis of the expression level of the RNA transcript of at 

4 least one gene selected from the group consisting of CD44v3; CD44v6; DR5; GROl; 

5 KRT1 7; and LAMC2, or their product, and 

6 (b) identifying the patient as likely to show resistance to treatment with an 

7 EGFR-inhibitor if the normalized expression levels of said gene or genes, or their 

8 products, are elevated above a defined expression threshold. 

1 2. The method of claim 1 wherein the patient is identified as likely to show 

2 resistance to treatment with an EGFR-inhibitor if the expression level of the RNA 

3 transcript of LAMC2 is elevated above a defined expression threshold. 

1 3. The method of claim 1 wherein the levels of the RNA transcripts of said 

2 genes are normalized relative to the mean level of the RNA transcript or the product of 

3 two or more housekeeping genes. 

1 4. The method of claim 3 wherein the housekeeping genes are selected from 

2 the group consisting of glyceraldehyde-3 -phosphate dehydrogenase (GAPDH), Cypl, 

3 albumin, actins, tubulins, cyclophilin hypoxantine phosphoribosyltransferase (HRPT), 

4 L32,28S,andl8S. 

1 5. The method of claim 3 wherein the sample is subjected to global gene 

2 expression analysis of all genes present above the limit of detection. 

1 6. The method of claim 5 wherein the levels of the RNA transcripts of said 

2 genes are normalized relative to the mean signal of the RNA transcripts or the products of 

3 all assayed genes or a subset thereof. 

1 7. The method of claim 6 wherein the level of RNA transcripts is determined 

2 by quantitative RT-PCR (qRT-PCR), and the signal is a Ct value. 
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1 8. The method of claim 7 wherein the assayed genes include at least 50 

2 cancer related genes, 

1 9. The method of claim 7 wherein the. assayed genes includes at least 1 00 

2 cancer related genes. 

1 10. The method of claim 1 wherein said patient is human. 

1 11. The method of claim 10 wherein said sample is a fixed, paraffin-embedded 

2 tissue (FPET) sample, or fresh or frozen tissue sample. 

1 12. The method of claim 10 wherein said sample is a tissue sample from fine 

2 needle, core, or other types of biopsy. 

1 13. The method of claim 10 wherein said quantitative analysis is perfonned by 

2 qRT-PCR. 

1 14. The method of claim 10 wherein said quantitative analysis is performed by 

2 quantifying the products of said genes. 

1 15. The method of claim 1 4 wherein said products are quantified by 

2 immunohistochemistry or by proteomics technology. 

1 16. The method of claim 10 wherein the EGFR-expressing cancer is selected 

2 from the group consisting of head and neck cancer, colon cancer, breast cancer, ovarian 

3 cancer, pancreatic cancer, and non-small cell lung carcinoma. 

1 17. The method of claim 16 wherein said cancer is head and neck cancer or 

2 colon cancer. 
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1 18. The method of claim 1 0 further comprising the step of preparing a report 

2 comprising a statement whether the patient is likely to respond well to treatment with an 

3 EGFR inhibitor. 

1 19. The method of claim 10 further comprising the step of preparing a report 

2 comprising a statement whether the patient is likely to show resistance to treatment with 

3 an EGFR inhibitor. 

1 20. A method for predicting the likelihood that a patient diagnosed with an 

2 EGFR -expressing head or neck cancer will respond to treatment with an EGFR inhibitor, 

3 comprising determining the normalized level of one or more prognostic RNA transcripts 

4 or their products in a sample comprising EGFR-expressing cancer cells obtained from 

5 said patient, wherein the prognostic transcript is the transcript of one or more genes 

6 selected from the group consisting of: CD44s; CD82; CGA; CTSL; EGFRd27; IGFBP3; 

7 p27; P53; RBI; TIMP2; YB-1; A-Catenin; AKT1; AKT2; APC; Bax; B-Catenin; BTC; 

8 CCNA2; CCNE1; CCNE2; CD105; CD44v3; CD44v6; CD68; CEACAM6; Chk2; cMet; 

9 COX2; cripto; DCR3; DIABLO; DPYD; DR5; EDN1 endothelin; EGFR; EIF4E; 

10 ERBB4; ERK1; fas; FRP1; GROl; HB-EGF; HER2; IGF1R; IRS1; ITGA3; KRT17; 

11 LAMC2; MTA1; NMYC; PAI1; PDGFA; PGK1; PTPD1; RANBP2; SPRY2; TP53BP1; 

12 and VEGFC, wherein expression of one or more of A-Catenin; AKT1 ; AKT2; APC; Bax; 

13 B-Catenin; BTC; CCNA2; CCNE1; CCNE2; CD105; CD44v3; CD44v6; CD68; 

14 CEACAM6; Chk2; cMet; COX2; cripto; DCR3; DIABLO; DPYD; DR5; EDN1 

15 endothelin; EGFR; EIF4E; ERBB4; ERK1; fas; FRP1; GROl; HB-EGF; HER2; IGF1R; 

16 IRS1; ITGA3; KRT17; LAMC2; MTA1; NMYC; PAI1; PDGFA; PGK1; PTPD1; 

17 RANBP2; SPRY2; TP53BP1 ; and VEGFC, or the corresponding gene product, above a 

1 8 defined threshold expression level indicates that the patient is likely to show resistance to 

19 treatment with an EGFR inhibitor, and expression of one or more of CD44s; CD82; CGA; 

20 CTSL; EGFRd27; IGFBP3 ; p27; P53; RBI; TIMP2; and YB- 1 , or the corresponding 

21 gene product, above a defined threshold expression level indicates that the patient is 

22 likely to respond well to treatment with an EGFR inhibitor. 

1 21 . The method of claim 20 comprising determining the normalized levels of 

2 at least two of said prognostic transcripts or their expression products. 
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1 22. The method of claim 20 comprising determining the normalized levels of 

2 at least 5 of said prognostic transcripts or their expression products. 

1 23 . The method of claim 20 comprising determining the normalized levels of 

2 all of said prognostic transcripts or their expression products. 

1 24. The method of claim 20 wherein said sample is a tissue sample. 

1 25. The method of claim 24 wherein said tissue is fixed, paraffin-embedded, 

2 or fresh, or frozen. 

1 26. The method of claim 24 wherein said tissue is from fine needle, core, or 

2 other types of biopsy. 

1 27. The method of claim 20 further comprising the step of preparing a report 

2 comprising a statement whether said patient is likely to respond well to treatment with an 

3 EGFR inhibitor. 

1 28. The method of claim 20 further comprising the step of preparing a report 

2 comprising a statement whether said patient is likely to show resistance to treatment with 

3 an EGFR inhibitor. 

1 29. A method comprising treating a patient diagnosed with an EGFR- 

2 expressing head or neck cancer and determined to have elevated normalized expression of 

3 one or more of the RNA transcripts of CD44s; CD82; CGA; CTSL; EGFRd27; IGFBP3; 

4 p27; P53; RBI; TIMP2; and YB-1 genes, or the corresponding gene products in said 

5 cancer, or decreased normalized expression of one or more of the RNA transcripts of A- 

6 Catenin; AKT1; AKT2; APC; Bax; B-Catenin; BTC; CCNA2; CCNE1; CCNE2; CD105; 

7 CD44v3; CD44v6; CD68; CEACAM6; Chk2; cMet; COX2; cripto; DCR3; DIABLO; 

8 DP YD; DR5; EDN1 endothelin; EGFR; EIF4E; ERBB4; ERK1; fas; FRP1; GROl; HB- 

9 EGF; HER2; IGF1R; IRS1; ITGA3; KRT17; LAMC2; MTA1; NMYC; PAI1; PDGFA; 
10 PGK1; PTPD1; RANBP2; SPRY2; TP53BP1; and VEGFC genes, or the corresponding 
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1 1 gene products in said cancer, with an effective amount of an EGFR-inhibitor, wherein, for 

12 each gene, elevated or decreased normalized expression is defined by a defined 

13 expression - threshold value. 

1 30. The method of claim 29 wherein said patient has been determined to have 

2 elevated or decreased normalized expression of all of said RNA transcripts or the 

3 corresponding gene products. 

1 3 1 . A method for predicting the likelihood that a patient diagnosed with an 

2 EGFR -expressing colon cancer will respond to treatment with an EGFR inhibitor, 

3 comprising determining the normalized level of one or more prognostic RNA transcripts 

4 .. or their products in a sample comprising EGFR-expressing cancer cells obtained from 

5 said patient, wherein the prognostic transcript is the transcript of one or more genes 

6 selected from the group consisting of: Bak; Bclx; BRAF; BRK; Cadl7; CCND3; CCNE1; 

7 CCNE2; CD105; CD9; COX2; DIABLO; ErbB3; EREG; FRP1; GPC3; GUS; HER2; 

8 HGF; ID1; ITGB3; PTPD1; RPLPO; STK15; SURV; TERC; TGFBR2; TITF1; XIAP; 

9 CA9; CD134; CD44E; CD44v3; CD44v6; CDC25B; CGA; DR5; GROl; KRT17; 

10 LAMC2; P14ARF; PDGFB; PLAUR; PPARG; RASSF1; RIZ1; Src; TFRC; and UPA, 

1 1 wherein the normalized level of one or more of CA9; CD 134; CD44E; CD44v3; CD44v6; 

12 CDC25B; CGA; DR5; GROl; KRT17; LAMC2; P14ARF; PDGFB; PLAUR; PPARG; 

1 3 RAS SF 1 ; RIZ1 ; Src; TFRC; and UPA, or the corresponding gene product, when above a 

14 defined expression threshold value, indicates that the patient is likely to show resistance 

15 to treatment with an EGFR inhibitor, and the normalized level of one or more of Bak; 

16 Bclx; BRAF; BRK; Cadl7; CCND3; CCNE1; CCNE2; CD105; CD9; COX2; DIABLO; 

17 ErbB3; EREG; FRP1; GPC3; GUS; HER2; HGF; ID1; ITGB3; PTPD1; RPLPO; STK15; 

1 8 SURV; TERC; TGFBR2; TITF1 ; and XIAP, or the corresponding gene product, when 

19 above a defined expression threshold value, indicates that the patient is likely to respond 

20 well to treatment with an EGFR inhibitor. 

1 32. The method of claim 31 comprising determining the normalized levels of 

2 at least two of said prognostic transcripts or their expression products. 
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1 33. The method of claim 3 1 comprising determining the normalized levels of 

2 at least 5 of said prognostic transcripts or their expression products. 

1 34. The method of claim 3 1 comprising determining the normalized levels of 

2 all of said prognostic transcripts or their expression products. 

1 35. The method of claim 3 1 wherein said sample is a tissue sample. 

1 36. The method of claim 35 wherein the tissue is fixed, paraffin-embedded, or 

2 fresh, or frozen. 

1 37. The method of claim 35 wherein the tissue is from fine needle, core, or 

2 other types of biopsy. 

1 38. The method of claim 3 1 further comprising the step of preparing a report 

2 comprising a statement whether the patient is likely to respond well to treatment with an 

3 EGFR inhibitor. 

1 39. The method of claim 3 1 further comprising the step of preparing a report 

2 comprising a statement whether the patient is likely to show resistance to treatment with 

3 an EGFR inhibitor. 

1 40. A method comprising treating a patient diagnosed with an EGFR- 

2 expressing colon cancer and determined to have elevated normalized expression of one or 

3 more of the RNA transcripts of Bak; Bclx; BRAF; BRK; Cadl7; CCND3; CCNE1 ; 

4 CCNE2; CD105; CD9; COX2; DIABLO; ErbB3; EREG; FRP1; GPC3; GUS; HER2; 

5 HGF; EDI; ITGB3; PTPD1; RPLPO; STK15; SURV; TERC; TGFBR2; TITF1; and 

6 XIAP genes, or the corresponding gene products in said cancer, or decreased normalized 

7 expression of one or more of the RNA transcripts of CA9; CD 1 34; CD44E; CD44v3 ; 

8 CD44v6; CDC25B; CGA; DR5; GROl; KRT17; LAMC2; P14ARF; PDGFB; PLAUR; 

9 PPARG; RASSF1; RIZ1; Src; TFRC; and UPA genes, or the corresponding gene 

10 products, with an effective amount of an EGFR-inhibitor, wherein for each gene elevated 
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11 or decreased normalized expression is determined relative to a defined expression 

12 threshold. 

1 41 . An array comprising polynucleotides hybridizing to the following genes: 

2 Bak; Bclx; BRAF; BRK; Cadl7; CCND3; CD105; CD44s; CD82; CD9; CGA;; CTSL; 

3 EGFRd27; ErbB3; EREG; GPC3; GUS; HGF; ID1; IGFBP3; ITGB3; ITGB3; p27; P53; 

4 PTPD1; RBI; RPLPO; STK15; SURV; TERC; TGFBR2; TIMP2; TITF1; XIAP; YB-1; 

5 A-Catenin; AKT1 ; AKT2; APC; Bax; B-Catenin; BTC; CA9; CCNA2; CCNE1 ; CCNE2; 

6 CD134; CD44E; CD44v3; CD44v6; CD68; CDC25B; CEACAM6; Chk2; cMet; COX2; 

7 cripto; DCR3; DIABLO; DP YD; DR5; EDN1 endothelin; EGFR; EIF4E; ERBB4; ERK1; 

8 fas; FRP1; GROl; HB-EGF; HER2; IGF1R; IRS1; ITGA3; KRT17; LAMC2; MTA1; 

9 NMYC; P14ARF; PAI1; PDGFA; PDGFB; PGK1; PLAUR; PPARG; RANBP2; 

10 RASSF1; RIZ1; SPRY2; Src; TFRC; TP53BP1; upa; and VEGFC, immobilized on a 



1 1 solid surface. 

1 42. The array of claim 41 wherein said polynucleotides are cDNAs. 

1 43. The array of claim 42 wherein said cDNAs are about 500 to about 5000 

2 bases. 

1 44. The array of claim 41 wherein said polynucleotides are oligonucleotides. 

1 45. The array of claim 44 wherein said oligonucleotides are about 20 to 80 

2 bases long. 

1 46. The array of claim 45 which comprises about 330,000 oligonucleotides. 

1 47. The array of claim 41 wherein said solid surface is glass. 

1 48. An array comprising polynucleotides hybridizing to the following genes: 

2 CD44v3; CD44v6; DR5; GROl; KRT17; LAMC2. 
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49. An array comprising polynucleotides hybridizing to the following genes: 
A-Catenin; AKT1; AKT2; APC; Bax; B-Catenin; BTC; CCNA2; CCNE1; CCNE2; 
CD105; CD44v3; CD44v6; CD68; CEACAM6; Chk2; cMet; COX2; cripto; DCR3; 
DIABLO: DP YD; DR5; EDN1 endothelin; EGFR; EIF4E; ERBB4; ERK1; fas; FRP1; 
GROl; HB-EGF; HER2; IGF1R; IRS1; ITGA3; KRT17; LAMC2; MTA1 ; NMYC; 
PAI1; PDGFA; PGK1; PTPD1; RANBP2; SPRY2; TP53BP1; VEGFC; CD44s; CD82; 
CGA; CTSL; EGFRd27; IGFBP3; p27; P53; RBI; TIMP2; and YB-L 

50. An array comprising polynucleotides hybridizing to the following genes: 
CA9; CD134; CD44E; CD44v3; CD44v6; CDC25B; CGA; DR5; GROl; KRT17; 
LAMC2; P14ARF; PDGFB; PLAUR; PPARG; RASSF1; RIZ1; Src; TFRC; UPA; 
CD44s; CD82; CGA; CTSL; EGFRd27; IGFBP3; p27; P53; RBI; TIMP2; and YB-1. 



5 1 . The method of any one of claims 1 , 20 and 3 1 , wherein RNA is isolated 
from said tissue by a procedure comprising: 

3 (a) incubating a section of said fixed, paraffin-embedded tissue specimen at a 

4 temperature of about 56 °C to 70 °C in a lysis buffer, in the presence of a protease, 

5 without prior dewaxing, to form a lysis solution; 

6 (b) cooling the lysis solution to a temperature where the wax solidifies; and 

7 (c) isolating the nucleic acid from said lysis solution. 

1 52. A kit comprising one or more of (1) extraction buffer/reagents and 

2 protocol; (2) reverse transcription buffer/reagents and protocol; and (3) qPCR 

3 buffer/reagents and protocol suitable for performing the method of any one of claims 1 5 

4 20 and 30. 

1 53. A method for amplification of a gene listed in Tables 5 A and 5B by 

2 polymerase chain reaction (PGR), comprising performing said PCR by using an amplicon 

3 listed in Table 5 A and 5B and a corresponding primer-probe set listed in Tables 6A-6F. 

1 54. A PCR primer-probe set listed in Tables 6A-6F. 

1 55 . A PCR amplicon listed in Tables 5 A and 5 
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SEQUENCE LISTING 

<110> Genomic Health 

vail d' Hebron university Hostipal 
Baker, Joffre 
cronin, Maureen 
Shak, Steve 
Basel ga, Jose 

<120> GENE EXPRESSION PROFILING OF EGFR 
POSITIVE CANCER 

<130> 39740-0005 

<140> unassigned 
<141> 2003-11-15 

<150> 60/427090 
<151> 2003-11-15 

<160> 372 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 78 
<212> DNA 

<213> Homo sapiens 
<400> 1 

cgttccgatc ctctatactg catcccaggc atgcctacag caccctgatg tcgcagccta 60 
taaggccaac agggacct 78 

<210> 2 
<211> 71 
<212> DNA 

<213> Homo sapiens 
<400> 2 

cgcttctatg gcgctgagat tgtgtcagcc ctggactacc tgcactcgga gaagaacgtg 60 
gtgtaccggg a 71 

<210> 3 
<211> 71 
<212> DNA 

<213> Homo sapiens 
<400> 3 

tcctgccacc cttcaaacct caggtcacgt ccgaggtcga cacaaggtac ttcgatgatg 60 
aatttaccgc c 71 

<210> 4 
<211> 69 
<212> DNA 

<213> Homo sapiens 
<400> 4 

ggacagcagg aatgtgtttc tccatacagg tcacggggag ccaatggttc agaaacaaat 60 
cgagtgggt 69 

<210> 5 
<211> 80 
<212> DNA 

<213> Homo sapiens 
<400> 5 

ggctcttgtg cgtactgtcc ttcgggctgg tgacagggaa gacatcactg agcctgccat 60 
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ctgtgctctt 



cgtcatctga 



80 



<210> 6 

<211> 66 

<212> DNA 

<213> Homo sapiens 

<400> 6 

ccattcccac cattctacct gaggccagga cgtctggggt gtggggattg gtgggtctat 60 
gttccc 66 

<210> 7 

<211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 7 

ccgccgtgga cacagactcc ccccgagagg tctttttccg agtggcagct gacatgtttt 60 
ctgacggcaa 70 

<210> 8 

<211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 8 

cttttgtgga actctatggg aacaatgcag cagccgagag ccgaaagggc caggaacgct 60 
tcaaccgctg 70 

<210> 9 

<211> 82 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ccttccgacc agcagatgaa gatcatcgaa atcaatttgg gcaacgagac cgatcctcat 60 
cagctcccaa tgtgcatata aa 82 

<210> 10 

<211> 79 

<212> DNA 

<213> Homo sapiens 

<400> 10 

gtgcaggaaa ggttcacaaa tgtggagtgt ctgcgtccaa tacacgcgtg tgctcctctc 60 



<210> 11 

<211> 81 

<212> DNA 

<213> Homo sapiens 

<400> 11 

agggagatgc cgcttcgtgg tggccgagca gacgccctcc tgtgtctgtg atgaaggcta 60 
cattggagca aggtgtgaga g 81 

<210> 12 

<211> 72 

<212> DNA 

<213> Homo sapiens 

<400> 12 

atcctagccc tggtttttgg cctccttttt gctgtcacca gcgtcgcgtt ccttgtgcag 60 
atgagaaggc ag 72 

<210> 13 
<211> 77 
<212> DNA 



cttactccat 



cgtgtgtgc 



79 
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<213> Homo sapiens 



<400> 13 

gaaggccaag aaccgagtca aattatattc cagtttaagg ccaatcctcc tgctgtgact 60 
tttgaactaa ctgggga 77 

<210> 14 

<211> 79 

<212> DNA 

<213> Homo sapiens 

<400> 14 

ccatacctca agtatttgcc atcagttatt gctggagctg cctttcattt agcactctac 60 
acagtcacgg gacaaagct 79 

<210> 15 

<211> 76 

<212> DNA 

<213> Homo sapiens 

<400> 15 

cctctgtgct acagattata cctttgccat gtacccgcca tccatgatcg ccacgggcag 60 
cattggggct gcagtg 76 

<210> 16 

<2U> 71 

<212> DNA 

<213> Homo sapiens 

<400> 16 

aaagaagatg atgaccgggt ttacccaaac tcaacgtgca agcctcggat tattgcacca 60 
tccagaggct c 71 

<210> 17 

<211> 82 

<212> DNA 

<213> Homo sapiens 

<400> 17 

atgctgtggc tccttcctaa ctggggcttt cttgacatgt aggttgcttg gtaataacct 60 
ttttgtatat cacaatttgg gt 82 

<210> 18 

<211> 75 

<212> DNA 

<213> Homo sapiens 

<400> 18 

gcaggtgtca gcaagtatga tcagcaatga ggcggtggtc aatatcctgt cgagctcatc 60 
accacagcgg aaaaa 75 

<210> 19 

<211> 72 

<212> DNA 

<213> Homo sapiens 

<400> 19 

gcccagtgcg gagaacaggt ccagcttgat tctcgtctct gcacttaagc tgttctccag 60 
gtgcgtgtga tt 72 

<210> 20 

<211> 90 

<212> DNA 

<213> Homo sapiens 

<400> 20 

atcaccgaca gcacagacag aatccctgct accaatatgg actccagtca tagtacaacg 60 
cttcagccta ctgcaaatcc aaacacaggt 90 
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<210> 21 

<211> 78 

<212> DNA 

<213> Homo sapiens 

<400> 21 

gacgaagaca gtccctggat caccgacagc acagacagaa tccctgctac cagagaccaa 60 
gacacattcc accccagt 78 

<210> 22 

<211> 69 

<212> DNA 

<213> Homo sapiens 

<400> 22 

cacacaaaac agaaccagga ctggacccag tggaacccaa gccattcaaa tccggaagtg 60 
ctacttcag 69 

<210> 23 

<211> 78 

<212> DNA 

<213> Homo sapiens 

<400> 23 

ctcataccag ccatccaatg caaggaagga caacaccaag cccagaggac agttcctgga 60 
ctgatttctt caacccaa 78 

<210> 24 

<211> 74 

<212> DNA 

<213> Homo sapiens 

<400> 24 

tggttcccag ccctgtgtcc acctccaagc ccagattcag attcgagtca tgtacacaac 60 
ccagggtgga ggag 74 

<210> 25 

<211> 84 

<212> DNA 

<213> Homo sapiens 

<400> 25 

gtgcaggctc aggtgaagtg ctgcggctgg gtcagcttct acaactggac agacaacgct 60 
gagctcatga atcgccctga ggtc 84 

<210> 26 

<211> 64 

<212> DNA 

<213> Homo sapiens 

<400> 26 

gggcgtggaa cagtttatct cagacatctg ccccaagaag gacgtactcg aaaccttcac 60 
cgtg 64 

<210> 27 

<211> 85 

<212> DNA 

<213> Homo sapiens 

<400> 27 

aaacgagcag tttgccatca gacgcttcca gtctatgccg gtgaggctgc tgggccacag 60 
ccccgtgctt cggaacatca ccaac 85 

<210> 28 

<211> 72 

<212> DNA 

<213> Homo sapiens 
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<400> 28 

cacagcctca cttctaacct tctggaaccc acccaccact gccaagctca ctattgaatc 60 
cacgccattc aa 72 

<210> 29 

<211> 76 

<212> DNA 

<213> Homo sapiens 

<400> 29 

ctgaaggagc tccaagacct cgctctccaa ggcgccaagg agagggcaca tcagcagaag 60 
aaacacagcg gttttg 76 

<210> 30 

<211> 78 

<212> DNA 

<213> Homo sapiens 

<400> 30 

atgtggaacc cccacctact tggcgcctga agttcttgtt tctgttggga ctgctgggta 60 
taaccgtgct gtggactg " 78 

<210> 31 

<211> 86 

<212> DNA 

<213> Homo sapiens 

<400> 31 

gacatttcca gtcctgcagt caatgcctct ctgccccacc ctttgttcag tgtggctggt 60 
gccacgacaa atgtgtgcga tcggag 86 

<210> 32 

<211> 79 

<212> DNA 

<213> Homo sapiens 

<400> 32 

tctgcagagt tggaagcact ctatggtgac atcgatgctg tggagctgta tcctgccctt 60 
ctggtagaaa agcctcggc 79 

<210> 33 

<211> 65 

<212> DNA 

<213> Homo sapiens 

<400> 33 

gggtctgtgc cccatgacac ctggctgccc aagaagtgtt ccctgtgtaa atgctggcac 60 
ggtca * 65 

<210> 34 

<211> 74 

<212> DNA 

<213> Homo sapiens 

<400> 34 

gggaggctta tctcactgag tgagcagaat ctggtagact gctctgggcc tcaaggcaat 60 
gaaggctgca atgg 74 

<210> 35 

<211> 72 

<212> DNA 

<213> Homo sapiens 

<400> 35 

gaccaaggtc ctggaatgtc tgcagcagaa ggtgaatggc atcctggaga gccctacggg 60 
tacagggaag ac 72 



WO 2004/046386 



PCT/US2003/036777 



6/58 



39740-0005 PCT.TXT 

<210> 36 
<211> 73 
<212> DNA 

<213> Homo sapiens 
<400> 36 

cacaatggcg gctctgaaga gttggctgtc gcgcagcgta acttcattct tcaggtacag 60 
acagtgtttg tgt 73 

<210> 37 

<211> 87 

<212> DNA 

<213> Homo sapiens 

<400> 37 

aggacgcaag gagggtttgt cactggcaga ctcgagactg taggcactgc catggcccct 60 
gtgctcagta aggactcggc ggacatc 87 

<210> 38 

<211> 84 

<212> DNA 

<213> Homo sapiens 

<400> 38 

ctctgagaca gtgcttcgat gactttgcag acttggtgcc ctttgactcc tgggagccgc 60 
tcatgaggaa gttgggcctc atgg ' 84 

<210> 39 
<211> 73 
<212> DNA 

<213> Homo sapiens 
<400> 39 

tgccacctgg acatcatttg ggtcaacact cccgagcacg ttgttccgta tggacttgga 60 
agccctaggt cca 73 

<210> 40 

<211> 62 

<212> DNA 

<213> Homo sapiens 

<400> 40 

tgtcgatgga cttccagaac cacctgggca gctgccaaaa gtgtgatcca agctgtccca 60 
at 62 

<210> 41 

<211> 72 

<212> DNA 

<213> Homo sapiens 

<400> 41 

gagtcgggct ctggaggaaa agaaaggtaa ttatgtggtg acagatcacg gctcgtgcgt 60 
ccgagcctgt gg 72 

<210> 42 

<211> 82 

<212> DNA 

<213> Homo sapiens 

<400> 42 

gatctaagat ggcgactgtc gaaccggaaa ccacccctac tcctaatccc ccgactacag 60 
aagaggagaa aacggaatct aa 82 

<210> 43 

<211> 81 

<212> DNA 

<213> Homo sapiens 
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<400> 43 

cggttatgtc atgccagata cacacctcaa aggtactccc tcctcccggg aaggcaccct 60 
ttcttcagtg ggtctcagtt c ~~ 81 

<210> 44 
<211> 86 
<212> DNA 

<213> Homo sapiens 
<400> 44 

tggctcttaa tcagtttcgt tacctgcctc tggagaattt acgcattatt cgtgggacaa 60 
aactttatga ggatcgatat gccttg ~~ 86 

<210> 45 
<211> 91 
<212> DNA 

<213> Homo sapiens 
<400> 45 

ataacaaagt gtagctctga catgaatggc tattgtttgc atggacagtg catctatctg 60 
gtggacatga gtcaaaacta ctgcaggtgt g 91 

<210> 46 
<211> 67 
<212> DNA 

<213> Homo sapiens 
<400> 46 

acggatcaca gtggaggaag cgctggctca cccctacctg gagcagtact atgacccgac 60 
ggatgag 67 

<210> 47 
<211> 91 
<212> DNA 

<213> Homo sapiens 
<400> 47 

ggattgctca acaaccatgc tgggcatctg gaccctccta cctctggttc ttacgtctgt 60 
tgctagatta tcgtccaaaa gtgttaatgc c 91 

<210> 48 
<211> 75 
<212> DNA 

<213> Homo sapiens 
<400> 48 

ttggtacctg tgggttagca tcaagttctc cccagggtag aattcaatca gagctccagt 60 
ttgcatttgg atgtg 75 

<210> 49 
<211> 68 
<212> DNA 

<213> Homo sapiens 
<400> 49 

tgatgcgcct ggaaacagtc agcaggcaac tccgaaggac aacgagataa gcacctttca 60 
caacctcg 68 

<210> 50 
<211> 73 
<212> DNA 

<213> Homo sapiens 
<400> 50 

cgaaaagatg ctgaacagtg acaaatccaa ctgaccagaa gggaggagga agctcactgg 60 
tggctgttcc tga 73 

<210> 51 
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<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 51 

cccactcagt agccaagtca caatgtttgg aaaacagccc gtttacttga gcaagactga 60 
taccacctgc gtg 73 

<210> 52 

<211> 80 

<212> DNA 

<213> Homo sapiens 

<400> 52 

gactccttcg tccccagttg ccgtctagga ttgggcctcc cataattgct ttgccaaaat 60 
accagagcct tcaagtgcca 80 

<210> 53 

<211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 53 

cggtgtgaga agtgcagcaa gccctgtgcc cgagtgtgct atggtctggg catggagcac 60 
ttgcgagagg 70 

<210> 54 

<211> 65 

<212> DNA 

<213> Homo sapiens 

<400> 54 

ccgaaatcca gatgatgatg ctcatggacc ctggtgctac acgggaaatc cactcattcc 60 
ttggg 65 

<210> 55 

<211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 55 

agaaccgcaa ggtgagcaag gtggagattc tccagcacgt catcgactac atcagggacc 60 
ttcagttgga 70 

<210> 56 
<211> 83 
<212> DNA 

<213> Homo sapiens 
<400> 56 

gcatggtagc cgaagatttc acagtcaaaa tcggagattt tggtatgacg cgagatatct 60 
atgagacaga ctattaccgg aaa 83 

<210> 57 

<211> 68 

<212> DNA 

<213> Homo sapiens 

<400> 57 

acgcaccggg tgtctgatcc caagttccac cccctccatt caaagataat catcatcaag 60 
aaagggca ^ 68 

<210> 58 
<211> 74 
<212> DNA 

<213> Homo sapiens 
<400> 58 
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ccacagctca ccttctgtca ggtgtccatc ccagctccag ccagctccca gagaggaaga 60 
gactggcact gagg 74 

<210> 59 

<211> 77 

<212> DNA 

<213> Homo sapiens 

<400> 59 

ccatgatcct cactctgctg gtggactata cactccagac ctcgcttagc atggtaaatc 60 
accggctaca aagcttc 77 

<210> 60 

<211> 78 

<212> DNA 

<213> Homo sapiens 

<400> 60 

accgggagcc ctacatgacc gaaaatacct gcaaccgtta ctgccgtgac gagattgagt 60 
cagtgaaaga gcttaagg 78 

<210> 61 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 61 

cgaggattgg ttcttcagca agacagagga actgaaccgc gaggtggcca ccaacagtga 60 
gctggtgcag agt 73 

<210> 62 

<211> 80 

<212> DNA 

<213> Homo sapiens 

<400> 62 t CA 

actcaagcgg aaattgaagc agataggtct tatcagcaca gtctccgcct cctggattca 60 

gtgtctcggc ttcagggagt 80 

<210> 63 

<211> 77 

<212> DNA 

<213> Homo sapiens 

<400> 63 CA 
ccgccctcac ctgaagagaa acgcgctcct tggcggacac tgggggagga gaggaagaag 60 

cgcggctaac ttattcc ' 1 

<210> 64 

<211> 78 

<212> DNA 

<213> Homo sapiens 

<400> 64 a 
tgagcgtcgc agaaaccaca acatcctgga gcgccagcgc cgcaacgacc ttcggtccag 60 

ctttctcacg ctcaggga 7 ° 

<210> 65 

<211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 65 ^ cn 

gcggaaggtc cctcagacat ccccgattga aagaaccaga gaggctctga gaaacctcgg 60 

gaaacttaga 70 

<210> 66 
<211> 66 
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<212> DNA 

<213> Homo sapiens 

<400> 66 

cggtggacca cgaagagtta acccgggact tggagaagca ctgcagagac atggaagagg 60 
cgagcc 66 

<210> 67 

<211> 68 

<212> DNA 

<213> Homo sapiens 

ctttgaaccc ttgcttgcaa taggtgtgcg tcagaagcac ccaggacttc catttgcttt 60 
gtcccggg 68 

<210> 68 

<211> 81 

<212> DNA 

<213> Homo sapiens 

<400> 68 , „ 

ccgcaacgtg gttttctcac cctatggggt ggcctcggtg ttggccatgc tccagctgac 60 
aacaggagga gaaacccagc a 81 

<210> 69 

<211> 67 

<212> DNA 

<213> Homo sapiens 

ttgttggtgt gccctggtgc cgtggtggcg gtcactccct ctgctgccag tgtttggaca 60 
gaaccca ~ 57 

<210> 70 

<211> 62 

<212> DNA 

<213> Homo sapiens 

actgaaggag acccttggag cctaggggca tcggcaggag agtgtgtggg cagggttatt 60 



ta ' 62 

<210> 71 

<211> 74 

<212> DNA 

<213> Homo sapiens 

agagccagtt gctgtagaac tcaaatctct gctgggcaag gatgttctgt tcttgaagga 60 
ctgtgtaggc ccag /4 

<210> 72 

<211> 76 

<212> DNA 

<213> Homo sapiens 

cccatggatg ctcctctgaa gagactttcc tcattgactg ccgaggcccc atgaatcaat 60 
gtctggtagc caccgg 7b 

<210> 73 

<211> 72 

<212> DNA 

<213> Homo sapiens 

<400> 73 

tgactttatg gagcccaagt ttgagtttgc tgtgaagttc aatgcactgg aattagatga 60 
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cagcgacttg gc 72 

<210> 74 

<211> 81 

<212> DNA 

<213> Homo sapiens 

<400> 74 

cgcttgccta actcatactt tcccgttgac acttgatcca cgcagcgtgg cactgggacg 60 
taagtggcgc agtctgaatg g 81 

<210> 75 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 75 ^ 
tccttcagct ttcacactgg gctcagaaat gaagttgcat gactcttctg gaagtcaggt 60 

gggaacagga ttt 73 

<210> 76 

<211> 69 

<212> DNA 

<213> Homo sapiens 

<400> 76 _ ~ n 

agtgggagac acctgacctt tctcaagctg agattgagca gaagatcaag gagtacaatg 60 

cccagatca 

<210> 77 

<211> 77 

<212> DNA 

<213> Homo sapiens 

cgaagccctt acaagtttcc tagttcaccc ttacggattc ctggagggaa catctatatt 60 
tcacccctga agagtcc 77 

<210> 78 

<211> 74 

<212> DNA 

<213> Homo sapiens 

ccagacgagc gattagaagc ggcagcttgt gaggtgaatg atttggggga agaggaggag 60 
gaggaagagg agga 

<210> 79 

<211> 75 

<212> DNA 

<213> Homo sapiens 

ccattctatc atcaacgggt acaaacgagt cctggccttg tctgtggaga cggattacac 60 
cttcccactt gctga 75 

<210> 80 

<211> 66 

<212> DNA 

<213> Homo sapiens 

<400> 80 , , CA 

tgtggcaagt gcaaatgtaa ggagtgcacc tacccaaggc ctctgccatc agactggatc 60 

tgcgac bb 

<210> 81 
<211> 64 
<212> DNA 
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cctgaacatg aaggagctga agctgctgca gaccatcggg aagggggagt tcggagacgt 60 
gatg b4 

<210> 82 

<211> 69 

<212> DNA 

<213> Homo sapiens 

catcttccag gaggaccact ctctgtggca ccctggacta cctgccccct gaaatgattg 60 
aaggtcgga by 

<210> 83 

<211> 80 

<212> DNA 

<213> Homo sapiens 

<400> 83 ^ . an 

tgttttgatt cccgggctta ccaggtgaga agtgagggag gaagaaggca gtgtcccttt bU 

tgctagagct gacagctttg <* u 

<210> 84 

<211> 79 

<212> DNA 

<213> Homo sapiens 

<400> 84 „ 
aagaggaacg gagcgagtcc ccgcgcgcgg cgcgattccc tgagctgtgg gacgtgcacc 60 

caggactcgg ctcacacat /y 

<210> 85 

<211> 68 

<212> DNA 

<213> Homo sapiens 

gccaactgct ttcatttgtg agggatctga accaatacag agcagacata aaggaaatgg 60 
gcctgagt b5 

<210> 86 

<211> 66 

<212> DNA 

<213> Homo sapiens 

aacaccaatg ggttccatct ttctgggctc ctgattgctc aagcacagtt tggcctgatg 60 
aagagg 

<210> 87 

<211> 69 

<212> DNA 

<213> Homo sapiens 

tcaccctctg tgacttcatc gtgccctggg acaccctgag caccacccag aagaagagcc 60 
tgaaccaca 

<210> 88 

<211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 88 rr\ 

cgactccgtt ctcagtgtct gacatcttga gtcccctgga ggaaagctac aagaaagtgg 60 
gcatggaggg 
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<210> 89 
<211> 74 
<212> DNA 

<213> Homo sapiens 
<400> 89 

tgctgttgct gagtctgttg ccagtcccca gaagaccatg tctgtgttga gctgtatctg 60 
tgaagccagg caag 74 

<210> 90 
<211> 70 
<212> DNA 

<213> Homo sapiens 
<400> 90 

gtggatgtgc cctgaaggac aagccaggcg tctacacgag agtctcacac ttcttaccct 60 
ggatccgcag 70 

<210> 91 
<211> 83 
<212> DNA 

<213> Homo sapiens 
<400> 91 

cctcagcaag acgttatttg aaattacagt gcctctctct caaggcccca aaccagtaac 60 
aatcagtttt gccaatcaca ctt S3 

<210> 92 

<211> 77 

<212> DNA 

<213> Homo sapiens 

gcagttggaa gacacaggaa agtatcccca aattgcagat ttatcaacgg cttttatctt 60 
gaaaatagtg ccacgca 77 

<210> 93 

<211> 76 

<212> DNA 

<213> Homo sapiens 

<400> 93 

agactgtgga gtttgatgtt gttgaaggag aaaagggtgc ggaggcagca aatgttacag 60 



gtcctggtgg tgttcc 

<210> 94 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 94 

cgttccgatc ctctatactg cat 

<210> 95 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 95 

aggtccctgt tggccttata gg 



76 



23 



22 
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<210> 96 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 96 

atgcctacag caccctgatg tcgca 

<210> 97 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 97 

cgcttctatg gcgctgagat 

<210> 98 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 98 

tcccggtaca ccacgttctt 

<210> 99 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 99 

cagccctgga ctacctgcac tcgg 

<210> 100 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 100 

tcctgccacc cttcaaacc 

<210> 101 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 101 

ggcggtaaat tcatcatcga a 

<210> 102 
<211> 24 
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25 



20 



20 
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<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 102 

caggtcacgt ccgaggtcga caca 

<210> 103 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 103 

ggacagcagg aatgtgtttc 

<210> 104 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 104 

acccactcga tttgtttctg 

<210> 105 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 105 

cattggctcc ccgtgacctg ta 

<210> 106 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 106 

ggctcttgtg cgtactgtcc tt 

<210> 107 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 107 

tcagatgacg aagagcacag atg 

<210> 108 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> primer 
<400> 108 

aggctcagtg atgtcttccc tgtcaccag 29 

<210> 109 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 109 on 
ccattcccac cattctacct 20 

<210> 110 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 110 

gggaacatag acccaccaat ^ 

<210> 111 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 111 

acaccccaga cgtcctggcc t ^-L 

<210> 112 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 112 

ccgccgtgga cacagact J-<* 

<210> 113 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 113 

ttgccgtcag aaaacatgtc a ^! 

<210> 114 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> primer 
<400> 114 

tgccactcgg aaaaagacct ctcgg 

<210> 115 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 115 

cttttgtgga actctatggg aaca 

<210> 116 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 116 

cagcggttga agcgttcct 

<210> 117 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 117 

ttcggctctc ggctgctgca 

<210> 118 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 118 

ccttccgacc agcagatgaa 

<210> 119 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 119 

tttatatgca cattgggagc tgat 

<210> 120 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
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<400> 120 

caatttgggc aacgagaccg atcct 

<210> 121 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 121 

gtgcaggaaa ggttcacaaa 

<210> 122 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 122 

gcacacacga tggagtaagg 

<210> 123 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 123 

agtgtctgcg tccaatacac gcgt 

<210> 124 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 124 

agggagatgc cgcttcgt 

<210> 125 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 125 

ctctcacacc ttgctccaat gta 

<210> 126 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 126 

ccttcatcac agacacagga gggcg 
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<210> 127 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 127 

atcctagccc tggtttttgg 

<210> 128 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 128 

ctgccttctc atctgcacaa 

<210> 129 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 129 

tttgctgtca ccagcgtcgc 

<210> 130 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 130 

gaaggccaag aaccgagtca 

<210> 131 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 131 

tccccagtta gttcaaaagt caca 

<210> 132 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 132 

ttatattcca gtttaaggcc aatcctc 
<210> 133 
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<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 133 

ccatacctca agtatttgcc atcag 25 

<210> 134 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 134 

agctttgtcc cgtgactgtg ta 22 

<210> 135 
<211> 29 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 135 

attgctggag ctgcctttca tttagcact 29 

<210> 136 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 136 . 7 
cctctgtgct acagattata cctttgc 

<210> 137 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 137 

cactgcagcc ccaatgct 15 

<210> 138 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 138 

tacccgccat ccatgatcgc ca 22 

<210> 139 
<211> 24 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> primer 
<400> 139 

aaagaagatg atgaccgggt ttac ^4 

<210> 140 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 140 

gagcctctgg atggtgcaat M 

<210> 141 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 141 

caaactcaac gtgcaagcct cgga ^ 

<210> 142 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 142 

atgctgtggc tccttcctaa ct " 

<210> 143 
<211> 27 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 143 „ 
acccaaattg tgatatacaa aaaggtt £-< 

<210> 144 
<211> 30 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 144 

taccaagcaa cctacatgtc aagaaagccc w 

<210> 145 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> primer 

<400> 145 ~ A 
gcaggtgtca gcaagtatga tcag ^ 

<210> 146 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 146 9n 
tttttccgct gtggtgatga zu 

<210> 147 
<2U> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 147 ~ 7 
cgacaggata ttgaccaccg cctcatt z/ 

<210> 148 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 148 1R 
gcccagtgcg gagaacag xo 

<210> 149 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 149 71 
aatcacacgc acctggagaa c ZJ - 

<210> 150 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 150 - n 
ccagcttgat tctcgtctct gcacttaagc w 

<210> 151 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
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<400> 151 

atcaccgaca gcacagaca 

<210> 152 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 152 

acctgtgttt ggatttgcag 

<210> 153 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 153 

ccctgctacc aatatggact ccagtca 

<210> 154 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 154 

gacgaagaca gtccctggat 

<210> 155 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 155 

actggggtgg aatgtgtctt 

<210> 156 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 156 

caccgacagc acagacagaa tccc 

<210> 157 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 157 
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cacacaaaac agaaccagga ct 

<210> 158 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 158 

ctgaagtagc acttccggat t 

<210> 159 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 159 

acccagtgga acccaagcca ttc 

<210> 160 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 160 

ctcataccag ccatccaatg 

<210> 161 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 161 

ttgggttgaa gaaatcagtc c 

<210> 162 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 162 

caccaagccc agaggacagt tcct 

<210> 163 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 163 

tggttcccag ccctgtgt 
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<210> 164 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 164 

ctcctccacc ctgggttgt 

<210> 165 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 165 

ctccaagccc agattcagat tcgagtca 

<210> 166 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 166 

gtgcaggctc aggtgaagtg 

<210> 167 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 167 

gacctcaggg cgattcatga 

<210> 168 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 168 

tcagcttcta caactggaca gacaacgctg 

<210> 169 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 169 

gggcgtggaa cagtttatct 

<210> 170 
<211> 19 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 170 

cacggtgaag gtttcgagt 

<210> 171 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 171 

agacatctgc cccaagaagg acgt 

<210> 172 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 172 

aaacgagcag tttgccatca g 

<210> 173 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 173 

gttggtgatg ttccgaagca 

<210> 174 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 174 

cctcaccggc atagactgga agcg 

<210> 175 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 175 

cacagcctca cttctaacct tctg 

<210> 176 
<211> 22 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> primer 
<400> 176 

ttgaatggcg tggattcaat ag 

<210> 177 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 177 

acccacccac cactgccaag etc 

<210> 178 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 178 

ctgaaggagc tccaagacct 

<210> 179 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 179 

caaaaccgct gtgtttcttc 

<210> 180 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 180 

tgctgatgtg ccctctcctt gg 

<210> 181 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 181 

atgtggaacc cccacctact t 

<210> 182 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> primer 
<400> 182 

cagtccacag cacggttata cc 22 

<210> 183 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 183 

agtcccaaca gaaacaagaa cttcaggcg 29 

<210> 184 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 184 ^ 
gacatttcca gtcctgcagt ca 22 

<210> 185 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 185 „ 
ctccgatcgc acacatttgt ^ u 

<210> 186 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 186 

tgcctctctg ccccaccctt tgt 25 

<210> 187 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 187 

tctgcagagt tggaagcact eta " 

<210> 188 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



WO 2004/046386 PCT/US2003/036777 

29/58 

39740-0005 PCT.TXT 

<400> 188 

gccgaggctt ttctaccaga a ^J- 

<210> 189 
<211> 28 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 189 OQ 
caggatacag ctccacagca tcgatgtc " 

<210> 190 
<2U> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 190 

gggtctgtgc cccatgac 

<210> 191 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 



20 



<400> 191 

tgaccgtgcc agcatttaca 

<210> 192 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 192 „ 
cctggctgcc caagaagtgt tccct « 

<210> 193 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 193 „ 
gggaggctta tctcactgag tga « 

<210> 194 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 194 

ccattgcagc cttcattgc 
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<210> 195 
<211> 29 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 195 - Q 
ttgaggccca gagcagtcta ccagattct ^ 

<210> 196 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 196 7n 
gaccaaggtc ctggaatgtc ^ u 

<210> 197 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 197 ?n 
gtcttccctg tacccgtagg 

<210> 198 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 198 7 a 
caggatgcca ttcaccttct gctg 

<210> 199 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 199 iq 
cacaatggcg gctctgaag 

<210> 200 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 200 9 ^ 
acacaaacac tgtctgtacc tgaaga ^ 

<210> 201 
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<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 201 

aagttacgct gcgcgacagc caa 

<210> 202 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 202 

aggacgcaag gagggtttg 

<210> 203 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 203 

gatgtccgcc gagtccttac t 

<210> 204 
<211> 29 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 204 

cagtgcctac agtctcgagt ctgccagtg 

<210> 205 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 205 

ctctgagaca gtgcttcgat gact 

<210> 206 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 206 

ccatgaggcc caacttcct 

<210> 207 
<211> 23 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> primer 
<400> 207 

cagacttggt gccctttgac tec 

<210> 208 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 208 

tgccacctgg acatcatttg 

<210> 209 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 209 

tggacctagg gcttccaagt c 

<210> 210 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 210 

cactcccgag cacgttgttc cgt 

<210> 211 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 211 

tgtcgatgga cttccagaac 

<210> 212 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 212 

attgggacag cttggatca 

<210> 213 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> primer 

<400> 213 1ft 
cacctgggca gctgccaa ±* 

<210> 214 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 214 „ 
gagtcgggct ctggaggaaa ag " 

<210> 215 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 215 1R 
ccacaggctc ggacgcac ±0 

<210> 216 
<211> 28 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 216 79l 
agccgtgatc tgtcaccaca taattacc " 

<210> 217 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 217 „ 
gatctaagat ggcgactgtc gaa 

<210> 218 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 218 7 c 
ttagattccg ttttctcctc ttctg « 

<210> 219 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
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<400> 219 

accaccccta ctcctaatcc cccgact 

<210> 220 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 220 

cggttatgtc atgccagata cac 

<210> 221 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 221 

gaactgagac ccactgaaga aagg 

<210> 222 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 222 

cctcaaaggt actccctcct cccgg 

<210> 223 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 223 

tggctcttaa tcagtttcgt tacct 

<210> 224 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 224 

caaggcatat cgatcctcat aaagt 

<210> 225 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 225 
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tgtcccacga ataatgcgta aattctccag 30 

<210> 226 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 226 ~ q 

ataacaaagt gtagctctga catgaatg " 

<210> 227 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 227 0A 
cacacctgca gtagttttga ctca ^ 

<210> 228 
<211> 30 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 228 ™ 
ttgtttgcat ggacagtgca tctatctggt *v 

<210> 229 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 229 7n 
acggatcaca gtggaggaag zu 

<210> 230 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 230 9n 
ctcatccgtc gggtcatagt ^ u 

<210> 231 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 231 ?n 
cgctggctca cccctacctg CK} 
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<210> 232 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 232 

ggattgctca acaaccatgc t ^-L 

<210> 233 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 233 - A 
ggcattaaca cttttggacg ataa 

<210> 234 
<211> 30 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 234 - n 
tctggaccct cctacctctg gttcttacgt ^ 

<210> 235 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 235 7n 
ttggtacctg tgggttagca ^ 

<210> 236 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 236 ?n 
cacatccaaa tgcaaactgg cyj 

<210> 237 
<211> 26 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 237 oa 
tccccagggt agaattcaat cagagc ^° 

<210> 238 
<211> 19 
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<212> DNA 
<213> Artificial Sequence 

<220> 

<223> primer 
<400> 238 

tgatgcgcct ggaaacagt 

<210> 239 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 239 

cgaggttgtg aaaggtgctt ate 

<210> 240 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 240 

agcaggcaac tccgaaggac aacg 

<210> 241 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 241 

cgaaaagatg ctgaacagtg aca 

<210> 242 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 242 

tcaggaacag ccaccagtga 

<210> 243 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 243 

cttcctcctc ccttctggtc agttggat 

<210> 244 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> primer 



<400> 244 

cccactcagt agccaagtca ^ u 

<210> 245 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 245 „ 
cacgcaggtg gtatcagtct ^ u 

<210> 246 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 246 77 
tcaagtaaac gggctgtttt ccaaaca 

<210> 247 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 



20 



<400> 247 

gactccttcg tccccagttg 

<210> 248 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 248 91 
tggcacttga aggctctggt a ^ x 

<210> 249 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 249 9C - 
ttgggcctcc cataattgct ttgcc ^ 

<210> 250 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 



WO 2004/046386 



<223> primer 
<400> 250 

cggtgtgaga agtgcagcaa 

<210> 251 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 251 

cctctcgcaa gtgctccat 

<210> 252 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 252 

ccagaccata gcacactcgg gcac 

<210> 253 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 253 

ccgaaatcca gatgatgatg 

<210> 254 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 254 

cccaaggaat gagtggattt 

<210> 255 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 255 

ctcatggacc ctggtgctac acg 

<210> 256 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
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<400> 256 1Q 
agaaccgcaa ggtgagcaa ^ 

<210> 257 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 257 91 
tccaactgaa ggtccctgat g ZJ - 

<210> 258 
<211> 26 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 258 7 ^ 
tggagattct ccagcacgtc atcgac 

<210> 259 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 259 71 
gcatggtagc cgaagatttc a ^ x 

<210> 260 
<211> 30 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 260 ™ 
tttccggtaa tagtctgtct catagatatc ™ 

<210> 261 
<211> 28 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 261 ™ 
cgcgtcatac caaaatctcc gattttga ^° 

<210> 262 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 262 17 
acgcaccggg tgtctga ^ 
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<210> 263 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 263 

tgccctttct tgatgatgat tatc 

<210> 264 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 264 

cccaagttcc accccctcca ttca 

<210> 265 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 265 

ccacagctca ccttctgtca 

<210> 266 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 266 

cctcagtgcc agtctcttcc 

<210> 267 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 267 

tccatcccag ctccagccag 

<210> 268 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 268 

ccatgatcct cactctgctg 
<210> 269 
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<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 269 9n 
gaagctttgt agccggtgat ^ 

<210> 270 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 270 

cactccagac ctcgcttagc atgg ^ 

<210> 271 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 271 1Q 
accgggagcc ctacatgac ±y 

<210> 272 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 272 17 
ccttaagctc tttcactgac tcaatct 

<210> 273 
<211> 28 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 273 7Q 
aaatacctgc aaccgttact gccgtgac ^ 

<210> 274 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 274 71 
cgaggattgg ttcttcagca a ZJ - 

<210> 275 
<211> 22 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> primer 
<400> 275 

actctgcacc agctcactgt tg 

<210> 276 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 276 

cacctcgcgg ttcagttcct ctgt 

<210> 277 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 277 

actcaagcgg aaattgaagc a 

<210> 278 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 278 

actccctgaa gccgagacac t 

<210> 279 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 279 

aggtcttatc agcacagtct ccgcctcc 

<210> 280 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 280 

ccgccctcac ctgaagaga 

<210> 281 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> primer 
<400> 281 

ggaataagtt agccgcgctt ct 

<210> 282 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 282 

cccagtgtcc gccaaggagc g 

<210> 283 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 283 

tgagcgtcgc agaaacca 

<210> 284 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 284 

tccctgagcg tgagaaagct 

<210> 285 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 285 

ccagcgccgc aacgaccttc 

<210> 286 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 286 

gcggaaggtc cctcagaca 

<210> 287 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
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<400> 287 

tctaagtttc ccgaggtttc tea 

<210> 288 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 288 

ccccgattga aagaaccaga gaggct 

<210> 289 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 289 

cggtggacca cgaagagtta a 

<210> 290 
<211> 19 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> primer 
<400> 290 

ggctcgcctc ttccatgtc 

<210> 291 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 291 

ccgggacttg gagaagcact gca 

<210> 292 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 292 

ctttgaaccc ttgcttgcaa 

<210> 293 
<2U> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 293 
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cccgggacaa agcaaatg 

<210> 294 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 294 

aagtcctggg tgcttctgac gcaca 

<210> 295 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 295 

ccgcaacgtg gttttctca 

<210> 296 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 296 

tgctgggttt ctcctcctgt t 

<210> 297 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 297 

ctcggtgttg gccatgctcc ag 

<210> 298 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 298 

ttgttggtgt gccctggtg 

<210> 299 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 299 

tgggttctgt ccaaacactg g 
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<210> 300 
<2U> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 300 

tggtggcggt cactccctct gc 

<210> 301 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 301 

actgaaggag acccttggag 

<210> 302 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 302 

taaataaccc tgcccacaca 

<210> 303 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 303 

tctcctgccg atgcccctag g 

<210> 304 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 304 

agagccagtt gctgtagaac tcaa 

<210> 305 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 305 

ctgggcctac acagtccttc a 

<210> 306 
<211> 27 
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<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 306 „ 
tctctgctgg gcaaggatgt tctgttc 

<210> 307 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 311 

gccaagtcgc tgtcatctaa 

<210> 312 
<211> 25 
<212> DNA 

<213> Artificial sequence 
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<400> 307 

cccatggatg ctcctctgaa 

<210> 308 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 308 „ 
ccggtggcta ccagacattg zu 

<210> 309 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 309 „ 
cattgactgc cgaggcccca tg zz 

<210> 310 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 310 91 
tgactttatg gagcccaagt t zx 

<210> 311 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
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<220> 

<223> primer 
<400> 312 

ttccagtgca ttgaacttca cagca 

<210> 313 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 313 

cgcttgccta actcatactt tec 

<210> 314 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 314 

ccattcagac tgcgccactt 

<210> 315 
<211> 19 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> primer 
<400> 315 

tccacgcagc gtggcactg 

<210> 316 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 316 

tccttcagct ttcacactgg 

<210> 317 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 317 

aaatcctgtt cccacctgac 

<210> 318 
<211> 29 
<212> DNA 

<213> Artificial sequence 
<220> 
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<223> primer 

<400> 318 0Q 
tccagaagag tcatgcaact tcatttctg zy 

<210> 319 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 319 9n 
agtgggagac acctgacctt zu 

<210> 320 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 320 7Ci 
tgatctgggc attgtactcc ^ 

<210> 321 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 321 2 o 
ttgatcttct gctcaatctc agcttgaga 

<210> 322 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 322 20 
cgaagccctt acaagtttcc 

<210> 323 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 323 20 
ggactcttca ggggtgaaat 

<210> 324 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
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<400> 324 

cccttacgga ttcctggagg gaac 

<210> 325 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 325 

ccagacgagc gattagaagc 

<210> 326 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 326 

tcctcctctt cctcctcctc 

<210> 327 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 327 

tgtgaggtga atgatttggg gga 

<210> 328 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 328 

ccattctatc atcaacgggt acaa 

<210> 329 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 329 

tcagcaagtg ggaaggtgta ate 

<210> 330 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 330 

tctccacaga caaggecagg actcg 
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<210> 331 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 331 „ 
tgtggcaagt gcaaatgtaa ^ 

<210> 332 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 332 

gtcgcagatc cagtctgatg zu 

<210> 333 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 333 ~ A 
cagaggcctt gggtaggtgc actc ^ 

<210> 334 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 334 An 
cctgaacatg aaggagctga cctgaacatg aaggagctga 4U 

<210> 335 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 335 1Q 
catcacgtct ccgaactcc " 

<210> 336 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 336 01 
tcccgatggt ctgcagcagc t ^ 

<210> 337 
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<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 337 7n 
catcttccag gaggaccact zu 

<210> 338 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 338 9n 
tccgaccttc aatcatttca zu 

<210> 339 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 339 0A 
ctctgtggca ccctggacta cctg Z£f 

<210> 340 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 340 ?n 
tgttttgatt cccgggctta ^ 

<210> 341 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 341 7A 
caaagctgtc agctctagca aaag 

<210> 342 
<211> 28 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 342 7R 
tgccttcttc ctccctcact tctcacct ^ 

<210> 343 
<211> 19 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> primer 
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<400> 343 1Q 
aagaggaacg gagcgagtc ±y 

<210> 344 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 344 1Q 
atgtgtgagc cgagtcctg 

<210> 345 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 345 „ 
cacgtcccac agctcaggga ate 

<210> 346 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 346 7a 
gccaactgct ttcatttgtg 

<210> 347 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 347 20 
actcaggccc atttccttta 

<210> 348 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 348 7R 
agggatctga accaatacag agcagaca ^° 

<210> 349 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> primer 
<400> 349 

aacaccaatg ggttccatct 

<210> 350 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 350 

cctcttcatc aggccaaact 

<210> 351 
<2U> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 351 

ttctgggctc ctgattgctc aagc 

<210> 352 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 352 

tcaccctctg tgacttcatc gt 

<210> 353 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 353 

tgtggttcag gctcttcttc tg 

<210> 354 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 354 

ccctgggaca ccctgagcac ca 

<210> 355 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
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<400> 355 

cgactccgtt ctcagtgtct ga 

<210> 356 
<211> 19 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> primer 
<400> 356 

ccctccatgc ccactttct 

<210> 357 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 357 

atcttgagtc ccctggagga aagc 

<210> 358 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 358 

tgctgttgct gagtctgttg 

<210> 359 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 359 

cttgcctggc ttcacagata 

<210> 360 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 360 

ccagtcccca gaagaccatg tctg 

<210> 361 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 361 
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gtggatgtgc cctgaagga 19 

<210> 362 
<211> 20 
<212> DNA 

<213> Arti f i ci al Sequence 
<220> 

<223> primer 
<400> 362 

ctgcggatcc agggtaagaa ^ u 

<210> 363 
<211> 28 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 363 0Q 
aagccaggcg tctacacgag agtctcac " 

<210> 364 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 364 „ 
cctcagcaag acgttatttg aaatt ZD 

<210> 365 
<211> 24 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> primer 

<400> 365 7A 
aagtgtgatt ggcaaaactg attg 

<210> 366 
<211> 26 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 366 ja 
cctctctctc aaggccccaa accagt 

<210> 367 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 367 „ 
gcagttggaa gacacaggaa agt " 
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<210> 368 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 368 

tgcgtggcac tattttcaag a 

<210> 369 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 369 

tccccaaatt gcagatttat caacggc 

<210> 370 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 370 

agactgtgga gtttgatgtt gttga 

<210> 371 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 371 

ggaacaccac caggacctgt aa 

<210> 372 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 
<400> 372 

ttgctgcctc cgcacccttt tct 
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